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SPELLING ERRORS AND THE SERIAL-POSITION EFFECT’ 


ARTHUR R. JENSEN 


University of California 


of spelling errors as a function of letter 


9.. and 11-letter words selected 


The relative frequencies 
position have been examined for 7- 
at random from the Thorndike-Lorge word list. These were admin- 
istered to 150 8th graders, 158 10th graders, and 89 junior college 
freshmen, respectively. The distribution of errors according to letter 
position was found to closely approximate the classical, skewed, bow- 


; 


shaped, serial-position curve for errors generally found in serial rote 
learning. Other features in common between spelling and serial 
learning were discussed, It is suggested that a theory of serial learning 
and of the serial-position effect may be germane to the psychology of 


spelling. 


Perusal of the research on spelling 
reveals that this subject has been tra- 
ditionally dominated more by a linguis- 
tic than by a psychological orientation 

Horn, 1960). Spelling difficulties have 
been analyzed largely in terms of pho- 
netics, syllabication, parts of speech, and 
frequency of word usage. Spelling re- 
search is also predominantly character- 
ized by a rather simple, unanalytical, 
correlational approach. Relationships 
have been found between measures of 
spelling ability and measures of reading, 
MA, IQ, achievement in various school 
subjects, and numerous other variables 
(Horn, 1960 


spelling studied as a learning phenome- 


Seldom, however, is 


non or examined in terms of principles 
derived from the experimental psychol- 
ogy of learning. The investigator has 
conceived of what might be called a 


psychological theory of spelling based 


1This study was supported in part by a 
Faculty Research Grant from the University 
of California and in part by the National 
Science Foundation. 


primarily on concepts in learning theory. 
Che purpose of the present investigation 
is to examine only one aspect of the 
formulation, viz., the serial-position ef- 
fect. The similarity of the distribution 
of spelling errors, when errors are 
plotted according to their position in the 
word, to the distribution of errors in 
serial rote learning—the so-called serial- 
position effect—has never before been 
The 


pelling and serial learning would seem 


noted correspondence between 
to offer some clues concerning the na- 
ture of spelling errors. 

When a series of words, nonsense syl- 
ables, symbols, or figures is learned, the 
‘rrois made before mastery are distrib- 
uted according to the position of each 
item in the series, producing the famil- 
iar, skewed, bow-shaped serial-position 
curve. Various theories have been pro- 
posed to account for this phenomenon 
Jensen, 1962a; McGeoch & Irion, 
1952, pp. 125-134). Since a word con- 
sists of a series of letters, and since spell- 
ing a word consists of putting each let- 
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ter in its proper order, we might ask 
whether the serial position effect, which 
is generally manifested in every form of 
serial learning investigated in the labo- 
ratory, is also manifested in spelling. If 
so, the distribution of spelling errors 
should be similar to the serial-position 
curves produced by the serial learning 
of other materials. That is to say, errors 
should generally be most frequent in 
the middle of a word, with fewest errors 
at the beginning and end of the word. 
That spelling errors occur more fre- 
quently in the middle of words has 
probably been noted before, but appar- 
ently not with sufficient precision to 
reveal the more subtle features of the 
serial-position effect. 

Variations in the shape of the serial- 
position curve are mainly a function of 
the number of items in the series. The 
curve for a short series is markedly 


skewed, with the largest proportion of 
errors occurring well past the middle of 


the series, As the series becomes longer, 
the serial-position curve becomes more 
symmetrical (McGeoch & Irion, 1952, 
p. 123). It would be interesting indeed 
if this were found to be the case also for 
the distribution of spelling errors in 
words of different lengths. The present 
investigation was addressed to this ques- 
tion. 


METHOD 
Spelling 


So that word selection would be unbiased 
with respect to the purpose of the present 
study, every 7-letter, 9-letter, and 11-letter 
word of the Thorndike-Lorge word list 
(1944, pp. 1-208) was typed on a slip of 
paper. The slips were thoroughly shuffled to 
eliminate alphabetical order, and then 75 
eight-item word lists were made up separately 
for 7-, 9-, and 11-letter words by drawing 
the words at random from the total word 
pool. Foreign words and proper nouns were 
eliminated. 

In order to maximize spelling errors while 
at the same time presenting words that were 
reasonably familiar to the subjects, it was 


decided to give the 7-letter words only to the 
eighth graders (N = 150), the 9-letter words 
only to tenth graders (N = 148), and the 
11-letter words only to junior college fresh- 
men (N = 89 

The spelling tests were selected at random 
from all the eight-item spelling lists. Every 
eighth grader was given 24 7-letter words; 
every tenth grader was given 24 9-letter 
words, but different lists were used in differ- 
ent Classes, so the total number of different 
9-letter words was 96. Each of the junior 
college students was given 48 11-letter words 

The subjects had mimeographed forms on 
which to print the words. For a 7-letter 
word, for example, a row of seven adjacent 
squares was provided, with the instructions to 
put one letter in each square, Similar forms 
were used for 9- and 11-letter words. This 
procedure permitted the position of each 
spelling error to be easily tabulated. 

For all groups of subjects, the words were 
read aloud by the regular classroom teacher 
in the fashion of the usual spelling test. Stu- 
dents were not given the words for study 
before the test. Neither the teachers nor the 
students knew the purpose of the test. 


Serial Learning 


To permit a detailed comparison between 
the position of errors in spelling and in serial 
rote learning, serial learning data were ob- 
tained, Sixty subjects (college students 
learned by the anticipation method a nine- 
item serial list composed of nine colored 
geometric forms: triangles, squares, and 
circles, colored red, blue, and yellow. The 
stimuli were projected automatically one at 
a time at a 3-second rate with a 6-second 
intertrial interval. (A detailed description of 
the apparatus is given elsewhere—Jensen, 
Collins, & Vreeland, 1962.) Never were fig- 
ures of the same shape or color adjacent in 
the series. A green light of 3-seconds dura- 
tion served as the signal for the subject to 
anticipate the first item in the series. Sub- 
jects responded by saying “red triangle, blue 
square,” etc. All subjects learned to a cri- 
terion of one perfect trial. different 
orders of stimuli were used, with 30 subjects 


Two 
assigned to each order 


RESULTS 


All the serial-position curves are pre- 
sented as the percentage of total errors 
that occur at each serial position, so 
that the area under all the curves is the 
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same. This method of presentation was 
adopted since the interest here does not 
concern the absolute number of errors 
but only the relative shapes of the 
curves for serial learning errors and for 
spelling errors. For the color-form serial- 
learning data, the percentage of errors 
at each position was determined for 
each subject individually and then aver- 
aged over all subjects, in order to weight 
each subject equally in the group curve. 
A similar method was applied to the 
spelling data. Since some words were 
misspelled much more frequently than 
others, the percentage of errors at each 
position was determined for each word 
and these were averaged over all words 
so that every word is weighted equally 
in the final curve. 
Figure 1 shows the 
errors as a function of letter position for 
96 9-letter words and for the serial posi- 
tion of errors made by 60 subjects in 
learning the color-form series. Both 
curves evince the essential features of 
the serial-position effect, that is, a bowed 
curve with the greater proportion of 
errors occurring after the middle posi- 
tion. The intraclass correlation between 
the pairs of data points for the two 
curves may be regarded as an index of 
the degree of similarity in the shapes of 
the curves (Haggard, 1958). The intra- 
class r was .81 between the spelling and 
the color-form curves. When the curves 
produced by the two serial orders of the 
color-form test were intercorrelated, the 


distribution of 


r was .96, which may be regarded as an 
equivalent-forms reliability of the shape 
of the serial-position curve. 

The error curves for 7- and 11-letter 
words are shown in Figure 2. These 
curves closely resemble serial-position 
curves found in the literature on serial 
learning (Jensen, 1962a; McGeoch & 
Irion, 1952, p. 123) with the degree of 
skewness varying as a function of the 
number of items in the series. The peak 
of errors for a 7-item list is generally at 


20 


Cent Errors 


Mean Per 








| Po 
Fic. 1. Serial-position curves for errors in 
nine-item color-form serial learning (60 sub- 
jects) and for spelling errors in nine-letter 
words. (The latter is based on a total of 
3,864 spelling errors, i.e., incorrect letters, 
made on 96 words by 148 tenth graders 


Position 5, and for an 11-item list it is 

at Position 7. The spelling errors con- 

form perfectly to these expectations. 
Since the usual method of learning 


to spell words is not the anticipation 
method generally used in serial learning 
experiments, but involves simultaneous 
viewing of all the letters in the word, 
one might ask if learning the color-forms 
by this method also evinces the serial- 
If it did not, it would 


position effect 


etter Pos 


Fic. 2. Serial-position curves for errors in 
7- and I1-letter words. (The curve for 7- 
letter words is based on a total of 1,546 spell- 
ing errors made on 24 words by 150 eighth 
graders. The 11-letter curve is based on a 
total of 2,262 spelling errors made on 48 
words by 89 junior college freshmen.) 
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be doubtful that the resemblance of the 
error curves for spelling to those of serial 
learning was indicative of a common 
psychological process underlying both 
forms of learning. To investigate this 
question, an additional 30 subjects (col- 
lege students) were given a color-form 
test that involved simultaneous presenta- 
tion of all the items in the series. Nine 
colored plastic forms (red triangle, 
green square, blue triangle, yellow dia- 
mond, red circle, yellow triangle, blue 
circle, pink square, and green diamond) 
were placed in a row before the subject. 
After the subject had studied the order 
of the objects for 10 seconds, the objects 
were disarranged haphazardly and the 
subject was asked to rearrange them in 
the proper order. This procedure was 
repeated, always with the same serial 
arrangement of the objects, until the 
subject learned to reconstruct their order 
perfectly. There was a different order 
for each subject. 


Figure 3 shows the serial-position 
curve obtained by this method of simul- 


taneous presentation. It may be re- 
garded as a spatial, rather than the 
usual temporal, form of serial learning. 
The serial-position effect clearly is not 
limited to learning by the anticipation 
method (Jensen, 1962b). It should be 
noted that the curve in Figure 3 more 
closely resembles the curve for misspell- 
ings in words in Figure 1 than does the 
serial anticipation curve in Figure 1. In 
brief, the distribution of spelling errors 
more closely corresponds to the spatial 
serial-position effect than to the tempo- 
ral. This is what we would expect, since 
the letters of words are viewed simul- 
taneously as in the spatial serial learn- 
ing procedure. The intraclass correla- 
tion between the words curve in Figure 
1 and the curve in Figure 3 is .90, 
which indicates a degree of correspond- 
ence between these curves that is almost 
as high as the “reliability” of the curves 
would permit. 


JENSEN 


DISCUSSION 


A strictly linguistic, rather than psy- 
chological, interpretation of these find- 
ings is not readily apparent. The ob- 
vious linguistic explanation of these 
results is in terms of the relative easiness 
of spelling the common prefixes and 
suffixes in words, which would tend to 
make for fewer spelling errors at the 
beginnings and ends of words having 
these elements. This explanation is un- 
convincing for several reasons. If pre- 
fixes and suffixes, most of which are 
composed of two or three letters, are 
learned as units, and if these prefixes 
and suffixes account for the shape of 
the error curve, then we should expect 
quite a differently shaped curve than 
was actually found in the present study. 
Each of the first three and last three 
letter positions should have an almost 
uniformly small proportion of errors. 
Furthermore, the differences in the 
skewness of curves for words of different 
lengths would be difficult to explain 
by the prefix-suffix hypothesis. It is 
apparent that even if the first and last 
three positions of the error curves are 
ignored, the middle portion of the word 
still manifests the bowed serial-position 





20 


@ ~ a 


Meon Percent of Tota! Errors 
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2 3 4 5 6 7 8 9 
Serial Position 
Fic. 3. The serial-position curve for errors 
on a spatial serial learning task in which all 
nine items were presented simultaneously on 
each trial (30 subjects). 
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curve for 11-letter 
even without the 


effect. Note the 
words in Figure 2; 
first and last three positions the errors 
are distributed in a fashion typical of 
serial-position curves. Furthermore, only 
a small proportion of the words used in 
this study had either suffixes or prefixes. 
Using as a criterion a list of 46 common 
Greek and Latin prefixes (McCrimmon, 
1950, p. 186), it was found that only 
16% of the 9-letter words had prefixes. 
The mean number of errors on the first 
three positions for these words was 5.60; 
for nonprefix words the mean number 
of errors was 6.80. This small differ- 
ence, in addition to the fact that only 
16% of the words had prefixes, could 
not account for the markedly bow- 
shaped error curves found in the present 
study. 

It is not claimed that spelling errors 
are solely a function of serial position. 
Certainly some words are phonetically 
more difficult than others, and this fac- 
tor is undoubtedly a large source of 
spelling errors. Each word has its own 
peculiar difficulties, and it appears that 
these difficulties can occur at any letter 
position in the word. It is only when the 
errors that are peculiar to individual 
words are “averaged out” by combining 
the position errors of a large number of 
words, as was done in the present ex- 
periment, that the distribution of errors 
associated with the serial-position effect 
becomes evident, Thus, a phonetically 
difficult element is probably more or less 
difficult according to its position in the 
word, 

There is still another interesting point 
of correspondence between spelling and 
serial learning. A number of studies 
(reviewed by Horn, 1960, pp. 1346- 
1347) have found that emphasizing 
difficult parts of words, in presenting 
the words in a lesson, by underlining, 
writing the letters in capitals, using bold- 
faced type, etc., is of little or no value in 
increasing the probability that the word 


will be spelled correctly. Emphasis of 
particular letters in a word corresponds 
to the so-called von Restorff procedure 
in serial learning, in which an item in 
the series is perceptually emphasized in 
some manner. The emphasized item is 
usually learned more readily than would 
be a nonemphasized item in the same 
position, but the difficulty of learning 
the list as a whole remains the same as 
if no item had been emphasized (Jen- 
sen, 1962c; Newman & Saltz, 1958). 
When errors are reduced at one posi- 
tion, they are correspondingly increased 
at other positions, so that there is no 
overall advantage from emphasizing a 
particular element in the series. This 
seems to be true in spelling as well as in 
serial learning. 
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THE EFFECTS OF RESPONSE MODE AND RESPONSE 
DIFFICULTY ON PROGRAMED LEARNING' 


ROBERT A. GOLDBECK anv VINCENT N. CAMPBELL 
American Institute for Research 


2 experiments in junior high school classes compared overt, covert, and 
reading response modes. In Experiment I, 63 Ss completed a program 
of independent facts at 1 of 3 levels of difficulty. An analysis of 
covariance of test scores showed an interaction between response mode 
and difficulty (p < .05), the overt group performing below the other 
groups at the low difficulty level and above the other groups at the 
intermediate difficulty level. In Experiment II, 62 Ss completed a 
continuous discourse program on light. The 3 response modes plus a 


4th option mode were used in a simple randomized design. 


The 


reading group surpassed the other groups on both immediate (NS) 


and 10-week retention (p < .5) tests. 


In both experiments the 


reading groups learned most per time spent (p < .01). 


One of the major advantages claimed 
for automated teaching is that it pro- 
vokes the student to respond actively to 
the material to be learned. The char- 
acteristic of automated techniques that 
accomplishes this goal is the requirement 
for the student to make some response 
during learning. In this regard, it is 
typically intended to prepare the learn- 
ing material such that all students will 
be able to respond correctly nearly all 
of the time (Skinner, 1958). 

There has been some question of the 
simple rationale that the most effective 
learning will be produced by highly 
cued and prompted material requiring 
overt responses which will be predomi- 
nantly correct (Briggs, 1960; Goldbeck 
& Briggs, 1960; Pressey, 1960). Evans, 
Glaser, and Homme (1960) and Silver- 
man and Alter (1960) have obtained 
data which fail to show the superiority 

1 This investigation was conducted by the 
American Institute for Research under Con- 
tract Nonr-3077(00) with the Office of Naval 
Research, Leslie J. Briggs, principal investi- 
gator. The investigation of the option re- 
sponse mode was part of a research project 
supported by Grant 736103.00 of the United 
States Office of Education, Department of 
Health, Education, and Welfare. 


of using overt responses in a simple 
learning situation. Angell and Lums- 
daine (1960) have shown that prompt- 
ing on three out of four trials is superior 
to prompting on all trials, following 
Cook’s work (Cook, 1958: Cook & 
Kendler, 1956; Cook & Spitzer, 1960) 
showing that prompting on three out of 
four trials is superior to no prompting. 

The experiments to be reported here 
concern the effects of response mode on 
programed learning. The overt written 
response mode will be compared with 
several other variants. The option re- 
sponse mode is most clearly related to 
the overt response mode. In the option 
mode the subject may write his response 
in the blank if he is sure it is correct, or 
if not sure he may leave the blank empty 
and proceed directly to view the feed- 
back. A second variant is the covert 
response mode. Here the subject simply 
attempts to “think” of the correct re- 
sponse before viewing the feedback. 
The third variant is much like conven- 
tional reading in that the correct re- 
sponse is printed in the blank. No 
feedback as such is provided. 

In Experiment I the overt, covert, 
and reading response modes are exam- 





ResPONSE Mope In AUTOMATED INSTRUCTION 111 


ined using independent facts as the 
learning content. Since each fact can 
be learned independently of the others, 
this kind of material lends itself well to 
variation in response difficulty by vary- 
ing the amount of cueing and prompting 
used. Experiment I is designed to test 
the hypothesis that the efficacy of a re- 
sponse mode is dependent on the cueing 
characteristics that determine learning 
response difficulty. 

This interdependency would seem to 
be most likely for the overt response 
mode. For highly cued or prompted 
responses, there is little chance of an 
error response occurring. In fact, the 
overt response may be performed in 
quite a perfunctory manner, with little 
responding other than 
Such 


responses may be considered copied re- 


’ 


or no “active’ 
the motor response of writing. 
sponses rather than constructed re- 
sponses and may produce an “illusion 
of learning.” With responses that are 
not so heavily cued, it is to be expected 
that constructing a correct overt re- 
sponse will produce a more relevant 
type of active participation, and at the 
same time, more errors during learning. 
If the more relevant response pattern 
facilitates learning to a greater extent 
than error responses degrade learning, 
some intermediate amount of cueing or 
prompting should be superior to highly 
cued responses. As cues and prompts 
are withdrawn further, error responses 
the advantages of the 
learning 


may override 


active overt and 
should fall off. 
In Experiment II, the overt, option, 


covert, and reading response modes Are 


res} sonsSe 


examined using continuous discourse 


material in order to extend the basic 
response mode findings of Experiment I 
to a more generally used type of learn- 
ing material. The notion about response 
difficulty discussed above led us to con- 


struct a form of continuous discourse 


programing with greater response diffi- 
culty than is typical. The option re- 
sponse mode is intended to increase the 
time efficiency of overt responding and 
to mitigate any adverse effect of error 
responses by allowing the subject to 
omit especially difficult responses 


EXPERIMENT | 
Me thod 


Material. The learning material consisted 
of 35 independent factual items of informa- 
tion covering men of history, geography, etc. 
They were presented in booklet form, one 
fact to a page, and in the same sequence for 
all subjects. For each response mode (overt, 
covert, and reading) each fact was prepared 
in easy, intermediate, and difficult versions. 
The difficult frames (pages) provided mini- 
mum cueing. Additional cues and prompts 
were added to obtain the intermediate and 
easy level frames. 

Three versions of one fact are shown below 
as examples of the three difficulty levels: 


A major industrial center for the coun- 
try, is the second 
largest city in Mexico. (Difficult) 

The Spanish settlers in Mexico named 
this city after a province and its capital 
city in Spain. A major industrial center 
for the country, is the 
Intermedi- 


second largest city in Mexic 


ate 

Ihe ideal climate of Guadalajara has 
made it a popular resort area for Ameri- 
cans, The Spanish settlers in Mexico 
named this city after a province and its 
capital city in Spain. A major industrial 
center for the country, .... -_ 
is the second largest city in Mexico. (Easy 


In the overt and covert response booklets, 
each frame was followed by an answer page 
showing the correct word for the blank in 
that frame. In the reading response booklet, 
the critical word was printed in the context 
of the frame as in conventional reading mat- 
ter, with the word underlined. No answer 
page followed the reading response frame. 
Che three response mode conditions, each in 
combination with the three difficulty levels, 
resulted in nine different kinds of learning 
booklets 

The criterion of amount learned was a test 
consisting of one question for each of the 35 


facts. The correct answer to the question was 
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the response word from the corresponding 
learning frame. The question used for the 
frames given as an example above was: 
“What is the second largest city in Mexico?” 
Subjects. A total of 63 subjects from two 
seventh grade classes participated in the 
study. Both the classes were of above-average 
ability, as shown by mean grade level scores 
of 8.45 and 8.49 for the two classes on the 
California Reading Maturity Test. 
Procedure and design, The experiment 
was conducted concurrently in two separate 
classrooms. Subjects were assigned randomly 
among the nine experimental conditions, six 
subjects to each cell. The remaining nine 
subjects were assigned to the overt response 
mode conditions as part of a separate analy- 
sis of overt responses. Subjects worked 
through the learning booklet at their own 
speed and could not return to any page once 
having completed it. As soon as the subject 
had completed the learning booklet, the ex- 
perimenter collected it and recorded the time. 
The subject completed the test at his own 
speed and was allowed to return to any 
question which he could not answer. 


Results 


Learning responses. Error rate dur- 
ing learning differed sharply among 
difficulty levels in the overt response 
mode, as planned. Mean numbers of 
correct learning responses (where 35 
represents a perfect score) for the diffi- 
cult, intermediate, and easy learning 
material groups were 6.7, 14.6, and 27.7, 
respectively (F = 15.03, p < .01, df = 
2/24). 

Test scores, Scores on the California 
Reading Maturity Test were used in a 
covariance analysis of criterion test 
scores. The average within-group corre- 
lation between the reading scores and 
test scores was .55. Adjusted mean test 
scores are shown in Table 1. The only 
significant effect was the Response 
Mode X Difficulty Level interaction 
(F = 4.25, p < .01, df = 4/35). The 
essence of this interaction is that the 
overt response group is superior at the 
intermediate level of difficulty and in- 
ferior at the easy level. 


TABLE 1 


MEAN Test Scores ADJUSTED BY THE 
CALIFORNIA READING Maturity TEsT: 
EXPERIMENT I 


Difficulty level 


Response 


mode Intermediate Difficult 


Overt 
Covert 


20.3 
21.3 
22.8 


0.9 
Reading 20.6 21.5 


Note.—Estimated SD within cells 4.7 


At the low difficulty level, the overt 
response group significantly 
lower than means for either of the other 
Using the co- 


mean is 


response mode groups. 
variance data with df = 35 to test ad- 
justed mean differences (Lindquist, 
1953), ¢ = 2.71, p < .05 and t = 2.85, 
p < .01 for overt group differences 
from the covert group and reading 
group, respectively. Requiring con- 
structed responses for the learning ma- 
terial for which constructing a response 
was relatively easy resulted in signifi- 
cantly poorer learning as measured by 
an immediate retention test. 

As constructing a response became 
more difficult, at the intermediate diffi- 
culty level, the test performance of the 
overt response group improved dramati- 
cally and significantly (¢ = 4.45, p < 
01). At this level of difficulty, the 
overt response group performed signifi- 
cantly better than the covert group 
(t = 2.92, p < .01). The value of re- 
quiring a constructed response appar- 
ently depends on the conditions under 
which a response is made. 

Test response—learning response re- 
lationship. In the 
effect of correct versus incorrect learn- 


order to evaluate 
ing responses on test performance, an 
analysis was performed on the learning 
and test responses for all 27 subjects 
who were required to give overt learning 


responses. At each difficulty level, a 
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TABLE 2 


PERCENTAGE CorreECT OF Test RESPONSES 
PRECEDED BY CORRECT VERSUS INCORRECT 
LEARNING RESPONSES: EXPERIMENT I 


Difficulty level 


Test responses Easy Intermediate Difficult 


Preceded by 54 74 
correct learning 
responses 
Preceded by 36 
incorrect learning 
responses 


total of 315 learning and test response 
pairs are available. These pairs were 
dichotomized according to whether the 
learning response was correct or incor- 
The number of correct test re- 
learning 


rect, 
sponses following a 
response and the number following an 


correct 


incorrect learning response were deter- 
mined, and were expressed as percent- 
ages. These data are presented in Ta- 
ble 2. 

While correct learning responses re- 
sult in a higher percentage of correct 


test responses at each difficulty level, it 


is apparent that this effect was out- 
weighed by other factors affecting test 
performance. In fact, test performance 
of the intermediate level group follow- 
ing incorrect learning responses was 
higher than test performance of the easy 
level group following correct learning 
responses (see Table 2 
Learning The 
learning time data is shown in Table 3. 
Note that the learning time means for 
the overt group follow the same pattern 
as the adjusted test score means. As 


time. summary of 


might be expected, due to the necessity 
to think of a response and to write it 
out, the overt groups spent 
more time in learning than the other 
groups. It is interesting to note that 
the covert groups spent no more time 
on learning than the reading groups 


response 


TABLE 3 


MEAN LEARNING Times (IN MINUTES 
EXPERIMENT I 
Response Difficulty level 


Mode Intermediate Difficult 


Overt 22.3 ~ 31.0 
15.3 


16.0 


19.2 
14.3 


15.2 


Covert 


Reading 


Note.—Estimated SD within cells 4.9 


TABLE 4 


ANALYSIS OF VARIANCE SUMMARY FOR 
LEARNING Time Data 
EXPERIMENT I 


Source 


Response mode 
Difficulty level 
Class 

RxD 

RxC 

DxC 

RxD 

Within 


Table 4 presents the analysis of variance 
for these data, showing a significant 
Response Mode X Difficulty Level in- 
teraction. 

efficiency. efhi- 


ciency scores were obtained by dividing 


Learning Learning 
the test score for each subject by his 
learning time score, giving a measure 
of amount learned per unit of learning 
time. Mean efficiency scores are shown 
in Table 5. An analysis of variance in- 
among 
the three response modes (F 13.74, 
b < .01, df = 2/36), with the reading 
mode most efficient, the overt response 


mode least efficient, and the covert re- 


dicated a clear-cut distinction 


sponse mode falling between. 
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TABLE 5 


MEAN LEARNING ErFicrency Scores (TEST 
Score/LEARNING TIME EXPERIMENT I 


Difficulty level 


Response 


Mode 





Easy Intermediate Difficult 





1.12 
1.32 
1.60 


0.67 0.82 
1.19 1.15 
37 1.60 


Overt 
Covert 


Reading 1. 





~ Note.—Estimated SD within cells = 0.37. 


Test time. Inasmuch as no time lim- 
its were imposed on test performance, 
it was possible to determine whether the 
experimental conditions during learn- 
ing had any effect on the time required 
to complete the test. A 3 X 3 analysis 
of variance on test score times for the 
response mode and difficulty variables 
showed a significant difference only 
among response modes (F = 4.74, p < 
.05, df = 2/45). The means for the 
overt, covert, and reading response mode 
conditions were 10.4 minutes, 13.4 min- 
utes, and 13.6 minutes, respectively. 


EXPERIMENT II 


Method 


Material. The learning material consisted 
of 32 frames on the topic of light, based on 
the initial portion of an autoinstructional 
program developed by Klaus, Holmok, Lums- 
daine, Meyer, & Quisenberry (1960). The 
modified program, in which a frame had 
from one to four response terms, contained 
a total of 51 response terms. 

The learning booklets followed the same 
format as in Experiment I, except for the 
feedback terms, The correct words for the 
blanks were printed on the right side of the 
page, in line with the appropriate blank, and 
were covered by a slip of gray paper. This 
cover could be lifted by the subject to show 
the correct words. As in Experiment I, there 
were no feedback terms for the reading re- 
sponse group. 

The criterion test of amount learned was 
a mixture of recall and recognition items. 
Although there was not a one-to-one corre- 
spondence between test items and’ learning 


frames, the test was designed to assign score 
values homogeneously among the areas of 
learning content. Some of the test answers 
had been a “response” term in the learning 
material while others had not. 

Subjects, Data were collected from a total 
of 62 subjects in two eighth grade classes. 
The ability level of the two classes was com- 
parable (¢ = .75) as measured by mean score 
on the reading portion of the School and 
College Ability Test (SCAT). Both classes 
were above average on this test. 

Procedure and design. The experiment 
was conducted concurrently in the two sep- 
arate Subjects were assigned 
randomly to the four experimental condi- 
tions: overt, option, covert, and reading 
response modes. The order of events for all 
subjects was (a) a practice program and test 
on mountaineering, (6) the experimental 
learning program on light, and (c) the cri- 
terion test. Each subject worked at his own 
speed. All subjects worked once through the 
learning program and were not allowed to 
turn back to earlier frames. Ten weeks after 
the learning and immediate test session the 
test was readministered to the two classes in 
long-term 


classrooms. 


order to provide a measure of 


retention. 


Results 
Learning Response Difficulty. Scor- 


ing the responses to the learning pro- 
gram on light gives some indication of 
the difficulty level for the program re- 
sponses, Subjects in the overt response 
group averaged 16% incorrect responses 
during learning. Subjects in the option 
group omitted an average of 12% of 
the responses and responded incorrectly 
to an average of 10% of the blanks. 
These error rates indicate that the pro- 
gram presented a relatively difficult level 
of responding as compared to typical 
continuous discourse programs. 
Amount Learned. Measures of 
mediate and long-term retention were 
test 


im- 


obtained from the scores on the 
given immediately after the program on 
light and again 10 weeks later. A co- 
variance analysis of the immediate test 
scores, using the reading scores from 


SCAT as the predictor variable, yielded 
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no significant differences among re- 
sponse modes (F = 2.19, p < .15, df = 
3/53) or between classes. The average 
within-cell correlation between the read- 
ing scores and test scores was .54. 

A corresponding analysis of covari- 
ance for the 10-week retention test data 
yielded significant response mode differ- 
ences (F = 3.39, p < .05). Although 
there was an appreciable decrease in 
test scores for all groups, the relative 
standing of the four response mode 
means remained the same after the 10- 
week period, as shown in Table 6. The 
only notable change in the magnitude 
of the differences was the slightly larger 
superiority of the reading 
group on the 10-week retention test. 

Learning time. The mean learning 
times for the four response mode groups, 
shown in Table 7, differed significantly 
(F = 26.08, p < .01). These means 
show a decreasing trend corresponding 
to decreased explicit requirement to 
respond actively. Using the analysis of 
variance error term with df 94 for ¢ 
tests between means, it was found that 
both writing conditions (overt and op- 
tion response modes) had significantly 
higher learning time means (p < .01 
than both “thinking” conditions (covert 
It is ap- 


response 


and reading response modes) . 
parent that the outstanding factor in 


TABLE 6 


IMMEDIATE AND 10-WeeEK RETENTION TEST 
Scores Apjustep By SCAT Reapinco 
Test: ExpermMment II 


Mean test score 
Response — 


10-wee 
mode 10-week 


Immediate retention 


Overt 
Option 
Covert 
Reading 


Estimated SD 
within cells 


TABLE 7 


MEAN LEARNING Times (IN MINUTES) AND 
SCAT Apjustep Erricirency Scores: 
EXPERIMENT II 


Mean 
Response learning 
mode time 


Mean 
efficiency 
adjusted ) 


1.40 
1.38 
1.95 
2.81 


Overt 20.7 
Option 

Covert 

Reading 


Estimated SD 
within cells 


0.71 


producing a learning time difference 
was the requirement to produce an overt 
response. There was also a significant 
learning time difference between classes 
F = 6.58, p < .05). The class with 
the larger mean learning time had the 
higher mean test score. 
Learning efficiency. The 
amount learned per unit of learning 
time was taken as an index of learning 
efficiency, as in Experiment I. A covari- 
ance analysis of efficiency scores pro- 
duced a highly significant value of F 


ratio of 


for the response mode variable (F = 
13.39, p < .01). The average within- 
group correlation between reading scores 
and efficiency scores was .45. Adjusted 
mean scores for response mode groups 
are shown in Table 7. The relative effi- 
ciency of response modes closely paral- 
leled Experiment I. 

Test time. It is appropriate to con- 
sider whether the experimental condi- 
tions had any effect on the time taken 
For 


by subjects to complete the test 


the immediate test, there was a signifi- 
cant difference among response modes 
F 4.18, p < .05). The mean test 
times for the overt, option, covert, and 
reading response modes were 10.3, 12.0, 
13.4, and 14.3 minutes, respectively. 
The order of these test times is exactly 
inverse to the order of learning times. 
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On the 10-week retention test, however, 
there were negligible time differences 
among response modes, The conditions 
of administration differed for the imme- 
diate and the 10-week retention test in 
that each subject began the immediate 
test as soon as he finished the learning 
program, while all subjects began the 
10-week retention test at the same time. 
Since a reliable test time difference 
among response modes was found only 
for the immediate test, it is likely that 
the difference and the same finding in 
Experiment I is attributable to the time 
pressures associated with test admin- 
istration. Subjects may have become 
aware that they were ahead of or be- 
hind the other members of the class as 
a result of learning time differences asso- 
ciated with response mode conditions, 
and some subjects may have compen- 
sated for this time difference by time 
spent on the immediate test. Apparently 
the group differences in time spent on 
the test had 
immediate test score, inasmuch as an 


no measurable effect on 


equivalent ordering of scores was ob- 
tained on the 10-week retention test for 
which there were no appreciable time 


differences. 
DISCUSSION 


The most general finding of Experi- 
ments I and II was the relatively low 
efficiency of the overt response condi- 
tion and the relatively high efficiency of 
the reading condition for both factual 
and continuous discourse learning. The 
question of why the reading mode has 
been found to be more efficient can be 
answered in general by considering that 
the set for an effective pattern of learn- 
ing can be established by presenting 
learning material in short logical steps 
(frames) that are interestingly written 
with an orientation for student respond- 
ing. The further question of whether 
this mode of autoinstructional program- 


ing can maintain its high level of effi- 
ciency over long periods of time needs 
further investigation. Techniques used 
in the present study, such as underlin- 
ing important words and promising a 
test at the end of a learning session may 
be sufficient to maintain learning effi- 
ciency, Other techniques, such as posing 
questions that will be answered by a 
block of subsequent frames and varying 
the format of the may be 
useful in maintaining efficiency. 

In nearly every comparison on the 
amount learned measure, the perform- 
ance of the overt response groups failed 
to exceed that of the reading groups. 
When learning responses were at the 
easiest level for the factual material, the 
inferiority of the overt response condi- 
tion was most marked. The only evi- 
greater amount learned 
condition was found 


program, 


dence for a 
under the overt 
with intermediate response difficulty in 
Experiment I. The exceptionally long 
learning time taken by this high per- 
forming overt response group is con- 
sistent with the hypothesis presented 
earlier concerning the active participa- 
tion that overt responding may stimu- 
late at an intermediate level of response 
difficulty. Since about the same number 
of words were written at all difficulty 
levels, the additional learning time re- 
quired by the overt group at the inter- 
mediate difficulty level cannot be at- 
tributed to writing. The 
active participation evoked in this group 
might be described as searching or 


time spent 


thought formulation. 

The results obtained for independent 
frames showed that the value of a cor- 
rect learning response is not so great as 
might be expected. The higher per- 
centage of correct test responses asso- 
ciated with correct learning responses 
that is shown in Table 2 of the results 
can be attributed in part to the likeli- 
hood that some of the facts to be learned 
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were already known by some of the 


students. This likelihood is evidenced 


by the finding that the percentage of 


correct test responses for correct learn- 
ing responses increased markedly as the 
amount of cueing was reduced. There 
also are differences in the inherent diffi- 
culty of the facts, increasing the likeli- 
that 
will be associated with correct test re- 


hood correct learning responses 
sponses. 

The most relevant item among the 
data for incorrect learning responses is 
the relatively high percentage of correct 
test responses for incorrect learning re- 
sponses at the intermediate difficulty 
level. This is interpreted as indicating 
that cueing a response somewhat below 
threshold may be an effective learning 
condition if the errors thus elicited are 
followed immediately by corrective feed- 
back. The failure of the option group 
group in either 
amount learned or speed of completion 
of the task (Experiment II) also sug- 
gests that avoidance of errors during 


to exceed the overt 


learning is not of critical importance 
The finding that the overt groups 
took much more time to learn nearly 
as much as the reading groups, might 
suggest that the 
learned the material and could be ex- 
pected to retain it longer than the read- 


overt groups over- 


ing groups. The results on the 10-week 
retention test in Experiment II do not 
support this expectation. The superior- 
ity of the reading group was, in fact, 
slightly greater after the 10-week period. 

It should be noted that the use of the 
test score and learning time ratio as an 
index of learning efficiency is not meant 
to imply a linear relationship between 
learning performance and time, Since, 
in fact, rate of learning tends to be 
higher early in learning, we might ex- 
pect learning conditions with short 
learning times to give relatively high 
indices of efficiency. Unless one condi- 


tion is superior to another in both 
amount learned and learning time, cau- 
tion should be employed in generaliz- 
ing relative efficiency to different learn- 
ing times. In the studies reported, we 
do not have adequate data for evalu- 
ating the increased level of learning 
which might be achieved during the 
additional time made available by more 
“efficient” learning conditions, 

A furthe 
learning efficiency is that a more precise 


consideration regarding 


measure of efficiency is obtained if 
amount learned is measured from a base 
determined by a chance test score or 
test scores obtained from a “no-learn- 
Such a refinement in the 
present studies would tend to magnify 


ing” group. 
slightly the obtained superiority of the 
reading group in learning efficiency. 
Finally, it should be recognized, in gen- 
eral, that time is not the only basis for 
Such 
cost of learning 


determining efficiency. items as 


amount or material, 
energy expended, and motivation loss 
are possible alternative considerations 
that may assume considerable impor- 
tance in practice. 


REFERENCES 
ANGELL, D., & Lumsparne, A. A. Stimulus 
presentation factors relevant to automated 
teaching. Paper read at Psychonomic So- 
ciety, Chicago, September 1960, 


Briccs, L. J. 
education 
Pittsburgh : 
searc h, 1960. 


Cook, J. O. Supplementary report: Proc- 
esses underlying learning a single paired- 
associate item. J. exp. Psychol., 1958, 56, 
$55. 


A survey of cueing methods in 
and in automated programs. 


American Institute for Re- 


Cook, J. O., & Kenprer, T. S. A theoreti- 
cal model to explain some paired-associate 
learning data. In G. Finch & F. Cameron 

Eds.), Symposium on Air Force human 

engineering, personnel, and training re- 
earch, Washington, D. C.: National 
Academy of Sciences-National Research 
Council, 1956 





118 Rospert A. GOLDBECK AND VINCENT N. CAMPBELL 


Cook, J. O., & Sprrzer, M. E. Supplemen- 
tary report: Prompting versus confirma- 
tion in paired-associate learning. J. exp. 
Psychol., 1960, 59, 275-276. 

Evans, J. L., Guaser, R., & Homme, L. E. 
A preliminary investigation of variation in 
the properties of verbal learning sequences 
of the “teaching machine” type. In A, A. 
Lumsdaine & R. Glaser (Eds.), Teaching 
machines and programmed learning: A 
source book, Washington, D. C.: National 
Education Association, 1960. Pp. 446-451. 


Gorpseck, R. A., & Briccs, L. J. An analy- 
sis of response mode and feedback factors 
in automated instruction. (Technical Re- 
port No. 2) Santa Barbara: American 
Institute for Research, 1960. 


Kuiaus, D. J., Hotmok, E. S., Lumspaine, 
, Meyer, D. S., & QuiseNnBerry, K. 
S. High school physics. Vol. 2. Reflection 


and refraction, Pittsburgh: American In- 
stitute for Research, 1960. 

LinpguisT, E. F, Design and analysis of ex- 
periments in psychology and education. 
New York: Houghton Mifflin, 1953. 


Pressey, S. L. Some perspectives and major 
problems regarding “teaching machines.” 
In A. A, Lumsdaine & R. Glaser (Eds.), 
Teaching machines and programmed learn- 
ing: A source book. Washington, D. C.: 
National Education Association, 1960, Pp. 
497-505. 


Sr_veRMAN, R. E., & ALTER, M. Note on the 
response in teaching machine programs. 
Psychol. Rep., 1960, 7, 496. 


SKINNER, B. F. Teaching machines. Science, 
1958, 128, 969-977. 


Received June 12, 1961) 





Journal of Educational Psychology 
1962, Vol. 53, No. 3, 119-126 


. 
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University of 


Massachusetts 


and Clarke School for the Deaf, Northampton, Massachusetts 


The purpose was to investigate concept attainment by deaf and 


hearing adolescents. 


There were 


30 deaf Ss matched first with 30 


hearing Ss on sex, age, and IQ and then with 30 hearing Ss on sex, 


school achievement, and IQ 


concept problems. The conclusions were 


All Ss were tested individually on 6 


a) deaf Ss manifest capa- 


bility equal to hearing Ss in tying together sensory impressions and in 
making use of these integrated sensory impressions to classify new 


objects, 
step in problem solving, (c 


b) deaf Ss show greater cautiousness in taking the initial 
deaf and hearing Ss proceed in concept 


attainment tasks by the use of similar strategies 


The purpose of this study is to investi- 
gate concept attainment by deaf and 
hearing adolescents in problem solving 
situations. Deafness renders difficult the 
coordination of speech categories to the 
nonlinguistic categories of the world 
Brown, 1956). But where nonlinguistic 
categories have defining attributes based 
more upon sensory properties than upon 
socially prescribed properties, it would 
appear that the deaf child would not 
be inferior to the hearing child. If this 
latter notion is confirmed, then there 
would be reason to believe that deafness 


may not radically alter the nature of 
conceptual ability in deaf children, but 
would simply prevent the acquisition of 


categories, or make less available those 
categories based upon socialization. 
Some studies indicate that the deaf 
subjects are less capable than hearing 
subjects of abstracting the essential at- 
tributes of objects or events that belong 
to a class and of using such attributes 
for identifying the new instances of this 
class without further learning (Hughes, 


1959; McAndrew, 1948; Myklebust & 


1This study was supported, in part, by a 
research grant (M-4392) from the National 
Institutes of Health, United States Public 
Health Service. 


Brutten, 1953; Oleron, 1953). A differ- 
ent view of the conceptual capacities of 
the deaf is stated by Kates, Kates, 
Michael, and Walsh (1961) and by 
Rosenstein (1959). For example, Kates 
et al. (1961) demonstrated that deaf 
subjects performed as well as hearing 
subjects, matched on age and IQ, when 
their categorizations were judged inde- 
pendently of their verbalizations. 

The problem of this study was to de- 
termine how deafness affects first, the 
ability to attain conjunctive concepts 
where all instances share common attri- 
butes) and second, the processes mani- 
fested in concept attainment (Bruner, 
1957; Bruner, Goodnow, & Austin, 1956; 
Werner, 1948). 

Specifically, the hypotheses of this 
study were: 

l. There 


concept attainment between deaf ado- 


will be no differences in 
lescent subjects and older hearing con- 
trol subjects matched on age and IQ 
the total number of choices be- 
fore solution, (6) the latency of the first 
choice in each conceptual problem, (c 
the total time taken to achieve solution, 
d) the number of guesses about the 
the number and 


in (a 


correct concept, (é) 
kind of changes made in attribute values 
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on the first choice, (f) the percentage 
of focusing changes made in the first 
four card choices, and (g) the compari- 
son of their respective performances on 
the orderly and random array of stimu- 
lus materials. 

2. Deaf adolescent subjects will be 
significantly superior in concept attain- 
ment to younger hearing control sub- 
jects matched on school achievement 
level and IQ in (a) the total number 
of choices before solution. Since the 
younger hearing subjects have also de- 
veloped formal thinking processes (In- 
helder & Piaget, 1958), it is hypothe- 
sized that there will be no differences in 
the underlying processes of concept at- 
tainment as follows: (b) the latency of 
the first choice in each conceptual prob- 
lem, (c) the total time taken to achieve 
solution, (d) the number of guesses 


about the correct concept, (¢) the num- 
ber and kind of changes made in attri- 
bute values on the first choice, (f) the 
percentage of focusing changes made in 


the first four card choices, and (g) the 
comparison of their respective perform- 
ances on the orderly and random array 
of stimulus materials. 


METHOD 


Subjects 


One group consisted of 30 deaf students 
(17 girls and 13 boys) at the Clarke School 
for the Deaf where the oral method of train- 
ing is employed. Each subject, profoundly 
deaf from birth or infancy, has no other 
physical defect, and is free from atypical 
personality characteristics. The deaf subjects 
have a mean IQ of 96.87 on the appropriate 
Wechsler intelligence scale, a mean age of 
16.4 years, and a mean school grade achieve- 
ment level on the Stanford Achievement Test 
equivalent to the 8.51 grade level. 

Our first control group (C,) consisted of 
30 hearing subjects (17 girls and 13 boys) 
from the senior high school. They have a 
mean IQ (Wechsler and Otis) of 97.63, a 
mean age at 16.55 years, and were progress- 
ing at an average pace in their studies. Our 


second control group (C:) was composed of 
30 hearings subjects (17 girls and 13 boys) 
from the junior high school. Their average 
IQ (Wechsler and Otis) is 97.50, their av- 
erage school achievement grade level (Stan- 
ford Achievement Test) is equivalent to the 
8.50 grade level, and their average age is 
13.6 years. All hearing subjects were free 
from overt personality and physical malad- 
justments 

There were no significant differences be- 
tween the deaf subjects and the junior high 
school students on school achievement grade 
level and on IQ, the two characteristics on 
which these subjects were matched. No sig- 
nificant differences were found between deaf 
subjects and the senior high school students 
on the two relevant characteristics of age 


and IQ. 


Concept Attainment Materials 


The concept attainment materials consisted 
of 81 cards measuring 3” X 5”, described by 
Bruner et al. (1956). On each card, there 
is a combination of four attributes comprising 
color, figures, number of figures, and borders. 
The color attribute has three values: red, 
black, and green. The figure attribute in- 
cludes cross, square, and circle. The number 
attribute includes one, two, and three. There 
are one, two, or three borders, the three dif- 
ferent values for the fourth attribute. Some 
of the cards are one black square with one 
border, three black squares with two borders, 
two green crosses with one border, three red 
circles with three borders. 

The 81 cards were displayed in two differ- 
ent conditions. The first condition was an 
orderly one where all the cards were grouped 
systematically with similar attributes and 
values placed in juxtaposition (orderly 
board). The second condition was a random 
one with the 81 cards assigned to positions by 
the use of random numbers (random board). 


Procedure 


All subjects were tested individually on the 
same problems which were administered in 
the same order. The deaf subjects were 
divided into two subgroups of 15 subjects 
each, matched on IQ, school achievement 
grade level, and age. The older hearing 
control group was also divided into two sub- 
groups of 15 subjects each, matched on age 
and IQ. The younger hearing control group 
was similarly separated into two subgroups 





Concert ATTAINMENT BY DEAF ADOLESCENTS 121 


matched on school achievement grade level 
and IQ. One subgroup of deaf subjects, 
older hearing subjects, and younger hearing 
subjects, respectively, solved the first three 
conceptual problems on the orderly board 
(first half) and three problems on the orderly 
board again (the second half). The other 
subgroup of deaf subjects, older hearing sub- 
jects, and younger hearing subjects, respec- 
tively, solved the first three conceptual prob- 
lems on the orderly board (first half) and 
three problems on the random board (second 
half). The nature of the task was completely 
described to the subjects who were to attain 
concepts that the experimenter had in mind; 
the concept was considered attained when 
the attribute values defining the concept were 
specified by the subjects. 

All the subjects completed three practice 
problems before attempting the six experi- 
mental problems. The experimental problems 
required the subjects to attain the following 
concepts: in the first half, green crosses, two 
squares, circles and one border; and in the 
second half, red figures, three black figures, 
two squares and two borders. The depend- 
ent variables were: 


1. The number of choices before attaining 
the concept. 

2. The latency of the first choice 

3. The total time taken for attaining the 
concept, 

4. The number of guesses made about the 
concept. 

5. The number and kind of changes made 
in attribute values on the first choice. This 
is a measure of strategy and evaluates first 
how many attribute values the subjects 
change on their first choice as contrasted 
with the focus card and second whether the 
more abstract attribute (number) or the less 
abstract attribute (figure) is changed on the 
first card. 

6. The percentage of focusing changes 
made in the first four choices. A focusing 
change is a change from the focus card in 
one value only of an untested attribute, thus 
evaluating the strategy pursued on the first 
four choices 


Statist al Pri ce dure 


Because we were not concerned with the 
overall differences among the three groups of 
subjects and because we were concerned with 
the differences between the deaf group and 
each matched hearing group separately, com- 


parisons were made of the deaf subjects with 
each one of the two hearing control groups. 
The analysis of variance technique was used 
for all dependent variables, 


RESULTS 


Deaf and Older Hearing Subjects on 
Choices, Latency, Total Time, Guesses, 
Attributes Changed, and Focusing 
Changes. The results indicated there 
were no significant differences between 
the deaf subjects and the older hearing 
subjects on the number of choices neces- 
sary to solve the six conceptual problems 

Table 1). The older hearing group 
showed a significantly (at .025 level 
smaller latency for their first choices 
than the deaf subjects (Table 1). In 
the total time taken to solve the con- 
ceptual problems, there was no signifi- 
cant difference between the deaf and 
older hearing groups. The older hear- 
ing subjects offered a significantly (at 
.025 level) greater number of incorrect 
guesses about the defining attributes be- 
fore solution than did the deaf subjects 

Table 1). 

There were no significant differences 
between the two groups on the number 
of changes and on the kinds of changes 
in attributes on the first card choice 
and on the percentage of focusing 
changes made in the first four chioces 

Table 2 

Deaf and Older Hearing Groups on 
the Orderly-Orderly and Orderly-Ran- 
dom Sequences, There were no signifi- 
cant differences between deaf and older 
hearing subjects with regard to the 
variables of card choices, latency, total 
time, guesses, number and kinds of at- 
tributes changed on first choice, and 


percentage of focusing changes on first 


four choices when their respective per- 
formances were compared on the or- 
derly-orderly and the orderly-random 
sequences (Tables 1 and 2 Finally, 
the differences between the respective 
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TABLE 1 


Means, SDs anv F Tests ror Dear AND Two Controt Groups ON 
Cuoices, Latency, Time, AND GUESSES" 


Total Time 
Choices (in seconds) (in seconds/10 


Group - ——— — |—— 


| 


Mean | AY Mean ‘ Mean 





All problems | 
Both sequences (O-O 
O-R) 
C, (older hearing | 
group ) ) 39.4" 98 | 118. 
D (deaf group) 49.3 20.6 | 128. 
C; (younger hearing | 
group ) 
Orderly-orderly se- 
quence (O-O) 


103.2 


C, 
D 
C: 
Orderly-random se- 
quence (O-R) 
C, 
D 
C; 
First half 
Orderly first half of 
O-O sequence 
Cc 
D 
C; 
Orderly first half of 
O-R sequence 


i 


Second half 
Orderly second half of 
O-O sequence 
C, 
D 
C; 
Random second half of 


O-R sequence 
29.6 9.6 20. 10.2 49.5 


5 
30.5 12.6 | 27.: 17.3 56.1 
40.4 / . 116 | 56.9 


C 
D 
C 





*All the F tests for the comparisons between the corresponding means of the deaf group and the older 
hearing group and between the corresponding means of the deaf group and the younger hearing group were 





not significant unless otherwise indicated. 
>F test was significant for the comparison between th 
©F test showed a trend at .10 level for the comparison between corresponding means of the deaf group and 


means of the deaf group and older hearing group 


the younger hearing group 
4F test was significant for the comparison between the means of the deaf group and the younger hearing 


group. 
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TABLE 2 


Means, SDs, anv F Tests ror Dear AND Two Controt Groups On First 


Cuoice CHANGES AND Focustnc CHANGES* 


Number oi at- 


Cumulative per 
Number of num- Number of figure | centage of focus 
tribute changes ber changes on changes on first ing changes in 
ad on first choice first choice choice first four choices 
Group 
Me SD 


Mean SD | Mean 
All problems 





SD 


Mean 
Both sequences (O-O 
O-R) 
C, (older hearing 
group) 
D (deaf group 
C: (younger hearing 
group) 


SD 


Orderly-orderly sequence 
(O-O 
Ci 
D 
C; 


Orderly-random se- 
quence (O-R 
Cc, 
D 
C; 
First half 


Orderly first half of O-O 
sequence 
C; 
D 
C; 


Orderly first half of O-R 
sequence 
C: 
D 
C: 
Second half 


Orderly second half of 
O-O sequence 
C; 
D 
C; 


Random second half of 


O-R sequence 
Ci ! 
D + 
C; 4. 


6 l 


, 1.0 
1.4 } 


: l d 
3 5 r l 6 
1.0 8 6 1.0 ; l » 
*All the F tests for the comparisons between the corresponding means of the deaf group and the older 
hearing group and between the corresponding means of the deaf group and the younger hearing group were not 
significant 
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performances of the deaf and the older 
hearing subjects on the first half and 
on the second half of the orderly-orderly 
sequence, and on the first half and the 
second half of the orderly-random se- 
quence were not significantly different 
for card choices, latency, total time, 
guesses, number and kind of attributes 
changed on the first choice, and per- 
centage of focusing changes on the first 
four choices (Tables 1 and 2). 

Deaf and Younger Hearing Subjects 
on Choices, Latency, Total Time, 
Guesses, Attributes Changed, and Fo- 
cusing Changes. The results indicated 
that the deaf subjects and the younger 
hearing subjects did not differ signifi- 
cantly on the number of choices re- 
quired before solving the six conceptual 
problems. However, there was a trend 
since the deaf subjects required a 
smaller number of choices than the 
younger hearing group with this differ- 
ence reaching the .10 level of signifi- 
cance (Table 1). 


The younger hearing subjects took 
significantly (at .01 level) less time to 


make their first choices in the con- 
ceptual problems than the deaf subjects 
(Table 1). The deaf and younger hear- 
ing subjects were not significantly dif- 
ferentiated on total time, number of 
guesses, number and kind of attributes 
changed on first choice, and on the per- 
centage of focusing changes on the first 
four choices (Tables 1 and 2). 

Deaf and Younger Hearing Groups 
on the Orderly-Orderly and Orderly- 
Random Sequences. There were no sig- 
nificant differences between deaf and 
younger hearing subjects with regard to 
card choices, latency, total time, guesses, 
number and kind of attributes changed 
on first choice, and percentage of focus- 
ing changes on the first four choices 
when their respective performance were 
compared on the two different se- 
quences, the orderly-orderly sequence 


and the orderly-random sequence. Fi- 
nally, there was no significant finding 
that the deaf and the younger hearing 
subjects were different when their re- 
spective performances on the first half 
of each sequence were contrasted with 
their performances on the second half 
of each sequence for the variables of 
card choices, latency, total time, guesses, 
number and kind of attributes changed 
on first choice, and percentage of focus- 
ing changes on the first four choices 
(Tables 1 and 2). 


DISCUSSION 


The results confirm Part a of the first 
hypothesis; the deaf and the older hear- 
ing subjects are equally capable of mak- 
ing use of discriminable attributes to 
distinguish exemplars of a concept from 
nonexemplars. Here, the deaf manifest 
capability equal to the hearing, in in- 
tegrating the sensory impressions that 
comprise the desired concept and in 
making use of these integrated sensory 
impressions to classify new objects. It 
can also be inferred that language 
achievement (the older hearing subjects 
are assumed to be more proficient in 
language than the deaf subjects) does 
not have important significance for effi- 
ciency in attaining concepts based upon 
sensory similarity rather than upon simi- 
larity dictated by culture or convention. 

That part of the first hypothesis in- 
volving the latency of the first choice in 
each conceptual problem was not sup- 
ported. Rather than no differences be- 
ing revealed, it appeared that the deaf 
took significantly more time in making 
this first choice than the hearing. It may 
be that this greater latency in respond- 
ing to problems may be a function of 
the sensory deprivation that the deaf 
suffer, which finds some expression in 
the greater reluctance to engage in 
initial exploratory behavior. But that 
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part of the first hypothesis dealing with 
time was substantiated, since both hear- 
ing and deaf subjects required about 
the same time to solve the problems. 
The older hearing group offered a 
significantly greater number of guesses 
than the deaf thus invalidating that 
part of the first hypothesis dealing with 


guesses. Explanations for this signifi- 


cant difference may be the recognition 
by the deaf of their relative inability in 
speech communication and their con- 


sequent reluctance to offer spoken 
guesses, and their possibly greater cau- 
tiousness and fear of failure prior to 
embarking upon a course of action. 

The parts of the first hypothesis deal- 
ing with the number and kind of 
changes made in attribute values on the 
first choice and with the percentage of 
focusing changes made in the first four 
card choices were confirmed. No sig- 
nificant differences in these two meas- 
ures yield support to the notion that 
deaf and older hearing groups pursue 
similar strategies in solving conceptual 
problems. 

The type of changes on the first 
choice—whether the figure or the num- 
ber is changed—would seem to be a 
measure of ability to deal with either a 
relatively less abstract attribute, the fig- 
ure, and/or a relatively more abstract 
attribute, the number. When the sub- 
ject changed the number, this signified 
that he was able to detach the number 
from the focus card, an attribute more 
difficult to abstract (Heidbreder, 1946). 
On the other hand, the change of the 
figure signified that a more concrete 
attribute was abstracted for change. 
The deaf and the older hearing subjects 
showed no differences in the kinds of 
changes made on the first card, It 
would appear that the deaf are no more 
attracted to the more concrete attributes 
than the hearing, and are able to ma- 
nipulate the more abstract attributes. 


Finally, the presumably greater diffi- 
culty in concept attainment when the 
cards are in random, rather than in 
orderly, array (found by Bruner et al., 
1956) did not distinguish the deaf and 
the older hearing groups on any vari- 
able. In the Bruner et al. (1956) pro- 
cedure, one group of subjects was intro- 
duced immediately to the random board, 
and there was a decrement in their 
performance as compared to the per- 
formance of a matched group on the 
orderly board. However, in our experi- 
ment, there were no differences appar- 
ently because performance on the or- 
derly board preceded the performances 
on both orderly and random boards and 
resulted in modes of attaining concepts 
that carried over with equal proficiency 
to these boards. 

Except for two parts, the second hy- 
pothesis was confirmed, indicating that 
the deaf and younger hearing groups 
were not different on total time, number 
and kind of changes made in attribute 
values on the first choice, percentage of 
focusing changes made in the first four 
card choices, and comparison of their 
respective performances on the orderly 
and randomly arrayed stimulus ma- 
terials. Hence, with regard to strategy 
and to the effect of increasing difficulty 
in stimulus materials, both deaf and 
younger hearing groups manifest about 
similar performance. 

The part of the second hypothesis 
dealing with the superiority of the deaf 
in number of choices required for solu- 
tion over the younger hearing was not 
substantiated. Nevertheless, the trend 
was in the predicted direction (at the 
.10 level) with the deaf requiring a 
smaller number of choices than the 
younger hearing group. 

The deaf group have a longer latency 
on their first choices than the younger 
hearing group, and thus that part of 
the second hypothesis was not supported. 
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It should be recalled that the deaf have 
significantly longer latencies than the 
older hearing group. These two find- 
ings point to the condition of deafness 
or to the training given to the deaf 
group as possibly inhibiting initial ex- 
ploratory behavior (Bruner, 1961). 

No significant differences between the 
deaf and the younger hearing group in 
number and kind of changes on first 
choice and in percentage of focusing 
changes on first four choices point to a 
similarity in their concept attainment 
strategies as predicted in our second 
hypothesis, At approximately 14 years 
of age, adolescents appear to develop 
formal operations that permit them to 
deal with problems using the same proc- 
esses as older adolescents of the same 
IQ (Inhelder & Piaget, 1958). The 
measures of number and kind of attri- 
bute changes on first choice and of per- 
centage of focusing changes on first four 
choices showed significant variations (at 


.05 level) from the first half to the sec- 
ond half of the experiment in the direc- 
tion predicted by increased difficulty of 
task. Hence, failure to find differences 
between the deaf and the two hearing 
groups, respectively, is probably not due 
to the lack of sensitivity of these meas- 


ures, 
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A COMPARISON OF TWO RESPONSE MODES IN 
AN AUTOINSTRUCTIONAL PROGRAM WITH 
CHILDREN IN THE PRIMARY GRADES 


EVAN R. KEISLAR anp JOHN D. McNEIL 
University of California, Los Angeles 


To test the value of overt responding with young children, a total of 
198 primary grade children divided into 2 matched groups were indi- 
vidually taught, by a teaching machine, a 3-week unit in physical 
science. The overt response group was required to respond to each 
frame by selecting the correct answer before the program would 
advance. The nonovert response group merely observed the entire 
program as it was automatically presented. Both groups learned sig- 
nificantly more than an uninstructed control group, but the difference 
between nonovert and overt responders was not significant. 





A number of recent studies have failed 
to find any advantage to the procedure 
of having learners respond overtly to 
the frames in an autoinstructional pro- 


gram. Permitting learners to “think” 


or simply to read the frame with the 
correct answer inserted produced re- 
sults which were usually effective and 
sometimes even superior (Evans, Glaser, 


& Homme, 1960; Goldbeck, 1960; Roe, 
Massey, Weltman, & Leeds, 1960; Sil- 
berman, Melaragno, & Coulson, 1961; 
Silberman, Melaragno, Coulson, & 
Estevan, 1961; Silverman & Alter, 
1961; Zuckerman, Marshall, & Groes- 
berg, 1961). For example, Goldbeck 
found with junior high school students 
that subjects who read the frames, the 
Implicit Response Group, learned more, 
with easy items at least, than those who 
were required to write these responses, 
the Overt Response Group. When the 
time required to learn was taken into ac- 
count, this superiority was even greater. 
These studies, however, used fairly ma- 
ture students as subjects, individuals 
who had probably already learned how 
to learn. 

Using young children as subjects with 
a program which called for a choice be- 
tween two responses on each frame, 
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Suppes and Ginsberg (1960) obtained 
different results. They found that 5- 
and 6-year-old children who were re- 
quired to correct overtly their errors 
learned more than children who were 
merely informed whether they had re- 
sponded correctly or not. 

It should be noted that most investi- 
gations with both older and younger 
subjects measured effects from relatively 
short periods of instruction, usually a 
single session. However, Holland 
(1960), using a semester-long program 
with college students found that overt 
responding to incomplete statements was 
more effective than simply reading the 
completed statements. It is important 
to know the effect of length of program 
and the conditions, if any, which make 
overt responding of young children de- 
sirable. 

It was the purpose of this study to 
test the value of overt responding with 
young children using a program which 
required approximately 3 weeks of in- 
struction. Specifically, it was hypothe- 
sized that children who are required to 
respond to each frame of a multiple- 
choice program with the correct an- 
swer will learn more than children who 
are not required to respond but who 
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observe the program and are also given 
at each step information as to the cor- 
rect answer. 

It was assumed that both those chil- 
dren who were required to respond 
overtly as well as those who were not so 
required would engage in implicit be- 
havior, The purpose of the study was 
not to assess the value of implicit be- 
havior; and hence, no attempt was made 
to inhibit covert responding of any 
child. The criterion posttest was com- 
posed largely of new questions which 
could only be answered if the subject 
had acquired appropriate implicit 
responses. A large portion of the multi- 
ple-choice frames in the program, there- 
fore, also required this mediating be- 
havior. The overt response of pushing 
buttons was not the terminal behavior 
sought. However, the very fact that the 
Overt Response Group was required, in 
connection with each frame, to make a 
visible overt act—to be scored right or 
wrong—was judged to increase the 


probability of the subject’s producing 
the desired implicit behavior. Speaking 
popularly, the subjects with an invest- 
ment openly at stake should pay closer 
attention and hence, learn more than 
those who make no visible commitment. 


METHOD 


Subjects 


The subjects for this investigation consisted 
of 118 children from School A in Grades 1 
and 2, and 80 children in School B from 
Grades 1 and 3, At each school the children 
were divided by matched pairs into two 
groups, an Overt Response and a Nonovert 
Response Group. In School B, 57 additional 
uninstructed children were used as a control 
group, permitting an estimate of the effec- 
tiveness of the program. 


Procedure 


All of the subjects from both experimeental 
groups were taught individually by a device, 
the videosonic teaching machine (experimen- 


tal model) , which projected slides and played 
a commentary to the learner through ear- 
phones. On the response panel, three buttons 
were available for responding to questions, 
each of which required a selection from two 
or three alternatives. The program consisted 
of 432 multiple-choice frames dealing with 
physical science and requiring each child to 
spend approximately 20 minutes a day for 
12 days. The difficulty level of this program, 
which has been described elsewhere (Keislar 
& McNeil, 1961) was previously found to be 
approximately 14% (proportion of frames 
missed on first attempt) for a comparable 
sample of first-grade children. 

Children in the Overt Response Group 
were required to press the correct button, the 
one in front of the appropriate picture, be- 
fore the program would advance. When they 
pressed the correct button, a green light was 
always turned on in front of the right answer. 
If they selected the wrong picture, these chil- 
dren were required to press a special button 
to clear the keyboard so as to permit another 
attempt. 

Children in the Nonovert Response Group 
were individually instructed by the same de- 
vice, but these children were not required to 
press buttons or make any overt response. 
To these “passive” observers the same pro- 
gram was automatically presented. A 5- 
second pause followed each question, approx- 
imating the time required by the other group 
to answer. At the end of this 5-second inter- 
val, a green light was turned on automati- 
cally in front of the right answer. 

In School A, 12 teaching machines were 
used, six for the Overt Response Group in 
one room, and six for the Nonovert Response 
Group in another room. In School B, 10 
machines were used in different parts of the 
same room, five for each of the two groups. 
Screens were placed between the machines to 
minimize for each subject distractions from 
other children. 

Although the children in the Nonovert 
Response Group were given no instructions 
to respond, on occasion a few children 
spontaneously pointed to a picture to indi- 
cate their answers to the questions they 
heard from the tape, but the amount of 
such overt responding decreased during the 
3-week period. 


Posttest 


In School A, the posttest consisted of, first, 
a standardized interview in which children 
were individually asked to explain or describe 
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certain physical phenomena, and, second, a 
multiple-choice group test which required 
each child to make a mark on his paper in a 
square corresponding to the correct picture 
projected on a screen. In School B, the ques- 
tions from the standardized interview were 
placed in multiple-choice form using as 
alternatives responses given earlier by chil- 
dren in the other school. In addition, the 
same multiple-choice test given in School A 
was used. In School B, therefore, both parts 
of the posttest consisted of multiple-choice 
items. 

In both schools each part of the posttest 
called for the application of principles taught. 
For example, children were asked to explain 
the formation of water on the outside of a 
glass of ice water, a task requiring a knowl- 
edge of molecular movement and condensa- 
tion. The subjects were given the first part 
of the posttest on the school day following 
the completion of the program; the second 
part of the posttest was administered on the 
next school day. 


RESULTS 


The reliability of the posttest in each 
school was computed by a split-half 


method, yielding, after correction, in- 
dices of .86 and .87 for the two schools 
based on all subjects in both experi- 


mental groups. These relatively high 
figures are, in part, attributable to the 
large spread of scores on the posttest. 
Such wide individual differences in 
scores are to be expected from a test of 


transfer; a more restricted range would 
have been expected if the test items had 
been identical with those in the pro- 
gram (McNeil & Keislar, 1961). 

Since the criterion test was slightly 
different in the two schools, the results 
are considered separately for each school, 
as shown in Table 1. It may be seen 
that the mean score for the Overt Re- 
sponse Group is only slightly higher in 
each school. In both instances, the dif- 
ference between the mean scores of the 
Overt and Nonovert Response Groups 
is practically zero, and is far from being 
statistically significant. 

In School B, a control group of 57 
youngsters from Grades 1 and 3 ap- 
peared each day to study a mathematics 
program. This group showed an aver- 
age score on the multiple-choice posttest 
of 36.6 with a standard deviation of 6.6. 
Since the differences between the ex- 
perimental groups and the control group 
are both highly significant, it is clear 
that the science program was effective 
in teaching. 


DISCUSSION 


Although the results of this investiga- 
tion are consistent with many studies 
involving older learners, they appear at 
variance with the findings of Suppes 


TABLE 1 


Posttrest Resutts In Two ScHOOLS FoR Overt RESPONSE AND 
NONOVERT ResPONSE Groups 











Group 





Overt Response 
Implicit Response 
B Overt Response 


B Implicit Response 








9.1 


*Scores for School A were obtained from a somewhat different test than the one used in School B 


> These standard errors are based on differences between pairs matched for grade, sex, intelligence, and, in 


addition, for School A only, pretest score 
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and Ginsberg (1960). There is, how- 
ever, an important difference between 
this latter study and the present one, In 
the Suppes and Ginsberg experiment, 
children who were not required to make 
an overt response of correction were 
merely told that their response was in- 
correct, These children did not do as 
well as those who were required to give 
the correct response. Where there are 
only two alternatives, it might be ex- 
pected that subjects, when told their 
answer is wrong, will “infer” that the 
other alternative is correct. Although 
this may be true for adults, children 
with limited experience may not do this. 
Young subjects may need to be told 
that the other alternative is correct. In 
the present investigation, children in the 
Nonovert Response Group were always 
given information for each frame telling 
which response was correct; they were, 
therefore, given in this way the same 
“information” as the children in the 
Overt Response Group. 

The failure to find a difference in 
favor of overt responding must be in- 
terpreted in the light of a number of 
considerations : 


1. The responses made by the Overt 
Response Group were not those meas- 
ured by the posttest. To evaluate the 
function of overt responses more effec- 
tively, it may be necessary to demon- 
strate that the behavior required on the 
posttest is directly related to the overt 
response demanded during the program. 
For example, if one were attempting to 
teach children to pronounce new words, 
it is conceivable that requiring the chil- 
dren to make an overt response, like 
saying words in completion of sentences, 
would be more effective than permitting 
the children merely to hear the com- 
pleted sentences spoken. 

2. The definition of an overt response 
adopted by this study is restricted to the 
required visible registration of the sub- 


jects’ answers. Assessment of the in- 
fluence of only this overt behavior was 
attempted. 

3. A report of “no difference” needs 
to be viewed in terms of the precision of 
the measures. The reliability of the 
criterion test seems to be satisfactory. 
The highly significant difference of 13 
points between the two experimental 
groups and the control group is evi- 
dence that a measurable product was 
effectively taught. The standard errors 
of the differences between the two ex- 
perimental groups (1.4 and 1.2 for 
Schools A and B, respectively) suggest 
that one may reject at the .01 level the 
hypothesis that the true difference is 
greater than four points in either school. 
If a four-point difference is judged to 
be unimportant, the hypothesis of a 
“negligible” difference may be accepted. 

4. The difficulty level of the program 
used may be too high to permit the 
Overt Response Group to show up well. 
Because this group was required to re- 
spond before being shown the right an- 
swer to each item, with difficult items 
they may have made more errors than 
those in the Nonovert Response Group. 
These latter children may well have 
waited until the green light showed the 
right answer before making an assuredly 
correct implicit response. Errors made 
by children during a program, even if 
corrected, may result in inferior learn- 
ing. 

5. For the learning of the new pat- 
terns of implicit responses called for in 
this study, th- equired overt response 
involving a reinforcement or “stake” 
may add nothing to the implicit be- 
havior of the learner. The program it- 
self, featuring color, sound, movement, 
and presentation of relevant cues, all 
factors involved in good programing, 
may be the critical factor. In short, the 
elicitation of appropriate implicit be- 
havior may have been accomplished 
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equally well, at least for this 3-week 
period of instruction, in both the Overt 
and Nonovert Response Groups because 
of the stimulus variables in the program. 
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THE PREDICTION OF THE ACADEMIC, ARTISTIC, 
SCIENTIFIC, AND SOCIAL ACHIEVEMENT OF 
UNDERGRADUATES OF SUPERIOR 
SCHOLASTIC APTITUDE’ 


JOHN L. HOLLAND anp ALEXANDER W. ASTIN 
National Merit Scholarship Corporation 


Undergraduate achievement in academic, artistic, scientific, and social 
areas was predicted from extensive assessments in the high school 
senior year for 4 independent samples of high aptitude students (N 
ranged from 354 to 953) over intervals from 1 to 4 years. The criteria 
of college achievement included grades, checklists of unusual accom- 
plishments in science and art (e.g., publications, winning public recog- 
nition through awards and prizes), and election to student offices. 
Predictors included: scholastic aptitude, personality, originality, 
interest, parental attitude variables, and background information. 
Correlational analyses indicated that achievers in each of the 4 areas 
resemble our stereotypes of the scientist, artist, leader, and academic 


achiever. 





Previous reports on the superior stu- 
dent have been concerned with the pre- 
diction of college freshmen grades (Hol- 
land, 1958, 1959, 1960), and with the 
correlates of scientific, artistic, and aca- 
demic achievement at the high school 
level (Holland, 1961). The present 
study, an extension of these four earlier 
studies, relates the student’s attributes 
as manifested in high school, to his 
achievement in college over 1-, 2-, 3-, 
and 4-year intervals. 

Four different kinds of talented per- 
formance served as the criteria of un- 
dergraduate achievement. They were: 
(a) creative performance in the arts 
(artistic achievement) , (b) creative per- 
formance in the sciences (scientific 


1This study is a part of the research pro- 
gram of the National Merit Scholarship Cor- 
poration and was supported by grants from 
the National Science Foundation and the 
Old Dominion Foundation. We are indebted 
to T. R. McConnell for providing some of 
the predictor data for the 1956 sample, and 
to the following persons for their assistance 
throughout this project: Virginia Chalmers, 
Laura Kent, and Elizabeth van Laer. 


achievement), (c) high freshman grades 
in college (scholastic achievement) , and 
(d) leadership (social achievement). 
Although scholastic achievement was 
reported on in earlier papers, it is dis- 
cussed again here for the purpose of 
comparing it with the three other kinds 
of achievement. 

The earlier studies presented exten- 
sive information about the student’s 
background and attributes. The pur- 
pose of the present paper is to organize 
this information so as to reveal what 
attributes are associated with each dif- 
ferent kind of achievement. It was as- 
sumed that the formulations derived 
from these correlational clusters would 
be useful in developing preliminary 
statements of a theory of achievement. 

Most of the previous studies of 
achievement at the undergraduate level 
have been limited to scholastic achieve- 
ment. However, it has been found that 
getting good grades in college has little 
connection with more remote and more 
socially relevant kinds of achievement; 
indeed, in some colleges, the higher the 
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student’s grades, the less likely it is that 
he is a person with creative potential 
(Holland, 1961). So it seems desirable 
to extend our criteria of talented per- 
formance. 


METHOD 
Student Samples 


The four student samples were obtained 
from the four studies reported earlier (Hol- 
land, 1958, 1959, 1960, 1961). Although 
these National Merit finalists were originally 
drawn at random from pools of the 5,000 to 
10,000 students who represented by state the 
top segments in aptitude of participants in 
the National Merit program (who numbered 
58,000 to 600,000 for the years 1956 and 
1959), the samples used in the present study 
cannot be called representative since cnly 
those students for whom we had complete 
predictive and criterion data were included. 
These students can be best characterized as 
students of extremely high scholastic apti- 
tude; the Ns, means, and SDs for the four 
samples are shown in Table 1. 

This summary suggests that the four sam- 
ples are comparable, at least with regard to 
aptitude, and that they do not differ much 
from the original samples from which they 
were drawn (see earlier reports). 


Predictors 


The predictors consisted of variables select- 
ed from the original assessments and ob- 
tained when the students were high school 
seniors. For the 1956 sample, these variables 
included the Originality, Complexity-Sim- 
plicity, Psychopathic Deviate, and Hypo- 
mania scales of the Omnibus Personality In- 
ventory (OPI), and the Aviator, Physicist, 
Social Science Teacher, Accountant, Sales 
Manager, Musician, Occupational Level, and 
Masculinity-Femininity scales of the Strong 
Vocational Interest Blank (SVIB). 

For the 1957 sample, the predictors in- 
cluded all the scales of the California Psycho- 
logical Inventory (CPI), and the Inventory 
of Beliefs (IB). 

For the 1958 sample, the predictors in- 
cluded the Cyclothymia (A), Dominance 
(E), Surgency (F), Autia (M), Radicalism 
(Q;), and Self-Sufficiency (Q.) scales of the 
Sixteen Personality Factor Questionnaire (16 
PF); the Physical Activity, Intellectuality, 
Responsibility, Conformity, Verbal Activity, 
Emotionality, Control, and Status scales of 
the Vocational Preference Inventory (VPI) ; 
and the Dominance, Dependency, Play, Per- 
sistence, Super-Ego and Tolerance for Ambi- 
guity scales of the National Merit Student 
Survey (NMSS). 

For the 1959 sample, the predictors in- 
cluded the Complexity-Simplicity, Independ- 


TABLE 1 


Scnotrastic Aptirupve (SAT) 


Sample 


Verbal 
Math 





Verbal 6 


Math 


Verbal 
Math 


658.63 
700.03 





* Boys, N = 369; girls, N 
> Boys, N = 681; girls, N 
© Boys, N = 294; girls, N 
“Boys, N = 282; girls, N 


691.40 
726.36 
73 

14.38 


80.91 


712.03 


Leve.s or STUDENT SAMPLES 


53.85 
60.00 673.37 
687.14 
660.14 


35 42.76 
68.04 


689.04 
650.96 


54.59 
68.60 


5 656.70 
3 638.25 


57.1 
60.7 
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ence of Judgment, and Originality scales of 
Barron’s Inventory of Personal Philosophy; 
the Mastery and Deferred Gratification scales 
of the NMSS;; the Seclusion of the Mother, 
Excluding Outside Influences, Avoidance of 
Communication, Ascendency of Mother, In- 
trusiveness, and Acceleration of Development 
scales of the Parental Attitude Research In- 
strument (PARI) ; the Initiative Scale of the 
Ghiselli Self-Description Inventory; father’s 
rank-ordering of nine traits for his children; 
and the Intellectuality, Responsibility, Emo- 
tionality, Control, and Status scales of the 
VPI. 

A number of other predictors were com- 
mon to two or more of the four student 
samples: the Mathematics and Verbal scales 
of the Scholastic Aptitude Test (SAT), high 
school grades (HSR), self and teacher rat- 
ings, the Creative Activities Score (CAS), 
and scores for creative performance in science 
and art during the high school years. 

In all, from 24 to 46 predictors were cor- 
related with the four criteria of undergrad- 
uate achievement for each student sample. 
Complete descriptions of these predictors are 
contained in the earlier reports. 


Criteria of Achievement 


A record of each student’s college perform- 
ance was obtained by polling the four sam- 
ples during the spring of 1960. At that time, 
the 1956 sample was at the end of its senior 
year in college (4-year interval between pre- 
dictors and criteria), the 1957 sample was at 
the end of its junior year (3-year interval), 
the 1958 sample was at the end of its sopho- 
more year (2-year interval), and the 1959 
sample was at the end of its freshman year 
(l-year interval). Students filled out the 
form by checking those achievements which 
applied to them. The following items from 
the form comprise the achievement scales: 


Leadership 


1. Elected to one or more student of- 
fices. Circle appropriate number: 1 2 3 
45 67 8 9 or more. 

2. Initiated a business enterprise of any 
kind. Explain. 

3. Initiated a student or social organi- 
zation of any kind. Explain. 

4. Received an award or special recog- 
nition for leadership of any kind, Explain. 


Scientific Achievement 


1. Received a research grant or Nation- 


al Science Foundation Fellowship. If 
grant, indicate nature, 

2. Received a prize or award for scien- 
tific paper or project. 

3. Gave an original paper at a scientific 
or professional meeting sponsored by a pro- 
fessional society or association. Name of 
society or association, 

4. Had scientific or scholarly paper pub- 
lished (or in press) in a scientific or pro- 
fessional journal. Give complete reference 
here if published. 

5. Invented a patentable device, If yes, 
indicate briefly its nature. 

6. Constructed an original piece of 
scientific apparatus. Describe briefly. 


Artistic Achievement 


1. Won one or more speech or debate 
contests, 

2. Had minor roles or leads in plays 
not produced by a college or university. 
Give name of organization producing play. 

3. Was regular performer on radio or 
TV program. 

4. Won prize or award in a musical 
competition as a performer. 

5. Composed music which has been 
given at least one public performance 

6. Arranged music for public perform- 
ance. 

7. Had one or more musical publica- 
tions. 

8. Performed with a professional or- 
chestra. 

9. Gave public recital (not collegiate): 

10. Received a literary award or prize 
for creative writing (poem, novel, play, 
etc.). 


Academic Achievement (grades in fresh- 
man year) equals the Honor Point Ratio 
(HPR) where A = 4,B = 3,C =2,D=1, 
and F = 0. Table 2 shows the intercorrela- 
tions of these criteria for the four student 
samples. The criteria appear to be relatively 
independent, except for the Artistic and 
Social Achievements, which are consistently 
positively related in all samples. 


RESULTS 


The correlations between the stu- 
dent’s attributes as manifested in high 
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rABLE 2 


INTERCORREL ATIONS OF THE Four CRITERIA or STUDENT ACHIEVEMENT IN 
CoLuece ror Four Groups or STUDENTS 


Academic 


Criterion 


1956 1957 1958 1959 


Academic 
Artistic 08 O8 Ol 
Social oe ee 


Scientific —01 13 00 





Artistic 


1956 1957 1958 1959 


Social Scientific 


1956 1957 1958 1959 


1956 1957 1958 1959 


—05 —06 —06 


—04 04-01-01 
36 07 04 04 00 


08 —05 00-14 03 


Note.—Correlations for boys are shown in upper right triangle of matrix for 1956, 1957, 1958, and 1959 
samples. Corresponding correlations for girls are shown in lower left triangle. 


school and his subsequent achievements 
in college are presented in four tables 
(Tables 3-6) .2 To make these extensive 
findings more comprehensible, person- 
ality summaries were prepared for each 
kind of achiever: the Social, Artistic, 
Scientific, and Scholastic Achiever, The 
variables which support each statement 
are given in the parentheses following 
the statement. Interpretative or specu- 
lative statements can be identified by 
their lack of variable numbers. Nearly 
all the relationships are predictive, i.e., 
from high school to college, although 
for some of the samples certain other 
personal variables such as self-ratings 
were collected at the same time as the 
criterion data; in the tables, concurrent 
relationships are indicated by a “c” 
after the correlation. 

For the most part, each descriptive 
summary is limited to findings which 
are either replicated (i.e., significant 
correlations are found between identical 
predictors and the criterion) in two or 
more samples, or quasi-replicated (i.e., 
significant correlations are found be- 

2The influence of aptitude was not par- 
tialed out since the SAT scores have only 
negligible relationships with achievement in 
these samples. Correlations range only from 
—.27 to .19, with 72% falling within + .10. 


tween similar predictors and the crite- 
rion) in two or more samples. For ex- 
ample, if two different dominance scales 
are found to be positively correlated 
with leadership, then we would say that 
the finding is quasi-replicated. Other 
quasi-replications are less obvious and 
depend more upon personal interpreta- 
tion. Using many similar rather than a 
few identical predictive variables is a 
good way to increase the generality of 
the findings. Both Underwood (1957, 
pp. 281-287) and Brunswik (1956) 
have discussed the possible advantages 
in such a procedure, 


Social Achiever 


Table 3 shows the correlations be- 
tween leadership in college and the stu- 
dent’s attributes as measured in his 
senior year of high school. 

Table 3 indicates that leadership in 
college is best predicted by the follow- 
ing high school achievements: Artistic 
Achievement (2), Number of Elective 
Offices (4), and Number of Honors 
Won, The inclusion of oral activities, 
such as winning debates and public 
speaking contests, on the high school 
Artistic Achievement checklist may ac- 
count for the predictive success of that 
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TABLE 3 


PREDICTION OF SocIAL ACHIEVEMENT (LEADERSHIP) OVER I1-, 2-, 3-, AND 4-YEAR PERIODS 


Variable 








Time interval 





1 year 


2 years 3 years 





Boys Girls 


Boys Girls 


Boys Girls 


4 years 


Boys Girls 





. Leadership 
. Artistic Achievement 


(HS) 


3. Self-Evaluation 
. Number of Elective 


Offices (HS) 
. Creative Activities (B) 
5. Sense of Destiny 
. Popularity 
. Self-Confidence 
. Aggressiveness 
. Drive to Achieve 
. Number of Honors 
Won in HS 
. Dependability 
. Q, Self-Sufficient 
. Breadth of Interest 
. Dominance 
. Sociability 
. Complexity-Simplicity 
Q, Radical 
. Self-Understanding 
. Creative Activities (A)* 
. Dominance 
. Expressiveness 


3. Scholarship 


. F Cheerful 

. Sociability 

. M Introverted 

. Flexibility 

. Accountant 

. Barron Originality 
. Cheerfulness 

. Super-Ego 

. Self-Acceptance 


3. Physicist 


. Originality 

. Music (Training Time) 

. Sales Manager 

. Responsibility 

. Dependable 

. Inventory of Beliefs 
(Chicago) 

. Communality 

. Perseverance 

. Achievement via 
Conformance 

Independence 
. Verbal Aptitude 
. Math Aptitude 


Self-Rating 


DRS 

Self-Rating 
Self-Rating 
Self-Rating 
Self-Rating 


Self-Rating 
16 PF 


CPI 
Self-Rating 
OPI 

16 PF 
Self—Rating 


NMSS 
Self-Rating 
Self-Rating 
16 PF 

CPI 

16PF 

CPI 

SVIB 

OPI 
Self-Rating 
NMSS 
CPI 

SVIB 
Self-Rating 


SVIB 
VPI 
Fathers Value 


CPI 
Self-Rating 


CPI 
Self-Rating 
SAT 


43c 


5lc 
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TABLE 3—Continued 


Variable Source 


Verbal Activity 
Status 
Social] Science Teacher 
A Sociable 
E Dominance 
Socialization 
Responsibility 
Capacity for Status 
Psychological 
Mindedness 
. Conformity 
Self-Control 
Intellectual Efficiency 
Achievement via 
Independence CPI 
. Good Impression 1P 
60. Sense of Well-Being CPI 


available or because it was not used for some other reason. 


are significant at or beyond the .05 level 


* This variable is an a priori scale of creative activities 


Time interval 





l year 2 years 3 years 4 years 





Boys Girls Boys Girls Boys Girls 


Boys Girls 


13 
1: 


08 a on 
08 ee 


Note.—Dashes indicate that a given variable was not correlated with the criterion either because it was not 


All correlations in this table and succeeding tables 


Subsequently the scores of high and low scorers on 


this scale were correlated with preference for a different set of activities, school preferences, etc., to produce 


another Creative Activities scale (Variable Number 5 

variable. Some of the findings with re- 
gard to personality scales seem to com- 
plement these results: Student leaders 
have a high Capacity for Status (53), 
aspire to occupations of high status 
47), and resemble men in, or express 
preferences for, occupations of a social 
and persuasive nature (Sales Manager, 
36; Verbal Activity, 46; Social Science 
Teacher, 48). In student 
leaders conceive of themselves as good 


addition, 


leaders: concurrent self-ratings of lead- 
ership ability have high correlations 
(.43 for boys and .51 for girls) with 
leadership in college. 

Student 
ways to the stereotype of the leader: 
they are self-confident (3, 6, 8, 32), 
popular (7), sociable (16, 25, 37, 49, 
52), enthusiastic (22, 24, 30, 56) , domi- 
nant and aggressive (9, 15, 21, 50), re- 
sponsible (12, 31, 50, 51), conventional 


leaders conform in many 


(18. 39, 40, 42, 54. 55. 58). and ex- 
troverted (26, 54). 

There are some 
these findings. For example, the student 
leader’s high self-rating on Originality 

34) is not supported by the other 
measures of originality. In fact, the 
nonleader gets higher scores on origi- 
nality on certain indices (17, 27, 29). 
Moreover, the leader’s high self-ratings 
on “independence” are _ inconsistent 
with the finding that indicates that he 
is dependent and lacking in self-suffi- 


inconsistencies in 


ciency (13). 


Artistic Achiever 


Table 4 summarizes the relationships 
between the student’s attributes in high 
school and his artistic achievement in 
college. 

It is clear from Table 4 that the best 
predictor of artistic achievement in col- 
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TABLE 4 


PREDICTION OF ARTISTIC ACHIEVEMENT OVER l1-, 2-, 


Variable 


. Artistic Achievement 


(HS) 


. Expressiveness 

. Originality 

. Creative Activities (A) 
. Physical Activity (B 

. Emotionality 

. Aviator 

. Masculinity-Femininity 
. Dominance 

. Leadership 


. Physicist 


2. Conformity 


. Status 

. Dominance 

. Sales Manager 

. Aggressiveness 

. SAT-Verbal 

. Accountant 

. Self-Confidence 

. Conservatism 

. Practical Mindedness 
. Capacity for Status 


Scientific Achievement 


(HS) 


. Self-Evaluation 


5. Cheerfulness 


. Musician 
. Sociability 
. High School Grades 


(HER) 


. Sense of Destiny 

. Social Science Teacher 
. Self-Acceptance 

. Tolerance for 


Ambiguity 


. Breadth of Interest 

. Complexity-Simplicity 
. SAT-Math 

. College Grades (HPR) 
. Social Presence 

. Responsibility 

. Sociability 

. Independence 

. Self-Understanding 

. M Introverted 

. E Dominance 


. Verbal Activity 


. Drive to Achieve 


Source 


3. 


, 


AND 4-YEAR PERIODS 


Time interval 





l year 


Boys Girls 





Self-Rating 
Self-Rating 


VPI 

VPI 

SVIB 
SVIB 

CPI 
Self-Rating 
SVIB 

VPI 

VPI 

NMSS 
SVIB 
Self-Rating 


SVIB 
Self-Rating 
Self-Rating 
Self-Rating 
CPI 


Self-Rating 
SVIB 
CPI 


DRS 
SVIB 
CPI 


NMSS 


OPI 


CPI 

VPI 
Self-Rating 
Self-Rating 
Self-Rating 
16 PF 

16 PF 

VPI 
Self-Rating 


31 


36 


Boys Girls 


35 
26 


29 


) years 


3 years 


c 4lc c 38c 


18 


Bow Girls Boys Girls 


28c 
30c 28c 
30c 
29c 


—25c—40c 


4 years 


24c 


3ic 
44c 


l6c 


—25 
—24 


—21 


20 


—18 


2lc 2ic 


—}7 


18 


18c 20c 


—20 


15c 
—10c 


—19c 


18c 18c 
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TABLE 4 


Source 


Self-Rating 
VPI 

VPI 

Teacher Ratin 


Athletic Ability 
Intellectuality 
Control 
Maturity 
Psychological 
Mindedness 
Occupational Level 
Barron Originality 
Persistence 
Hobby (duration of) 
Creative Activities 
Self-Control 
Psychopathic Deviate 
Popularity 
Verbal Activity 
B 


CPI 
SVIB 
OPI 
NMSS 


B 

CPI 

OPI 
Self-Rating 


“4 


wou oi ¢ 


ioe) 


PI 
PI 
PI 


> 


59 B 


60. Conformity 


61. Communality 


lege is similar achievement in high 
school. Both concurrent*® and predic- 
tive relationships for high school and 
collegiate artistic achievement approxi- 
mate a correlation of .30. High self- 
ratings on originality and participation 
in creative activities in high school pre- 
dict artistic achievement in college with 
similar efficiency. 

The artistic achiever possesses some 


of the stereotypic traits: originality, tol- 


erance for ambiguity, complexity of out- 
look, and a preference for creative ac- 
tivities (3, 4. 32. 34, 52. 55). The 
artistic achiever likes artistic vocations 
and dislikes occupations in science and 
the skilled trades (5, 6, 7, 11, 26 

The student artist resembles the stu- 
dent leader in his preference for voca- 
tions of a social character (30, 44, 59 
and in his high aspirations 3, SI 


$“Concurrent” refers here to the fact that, 
in certain samples, the student reported his 
high school achievements at the same time 
he reported his college achievements. The 
similarity between this concurrent correla- 
tion and the equivalent predictive one sug- 
gests that such retrospective reporting of high 
school achievements is fairly accurate. 
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Time interval 


1 vear 2 years 3 years 4 years 


Boys Girls Boys Girls Boys Girls Boys Girls 


—12¢ 


—12c 


g 


Is SOC iable 
19, 24, 29 


the student leader, he 
37, 38), 
expressive 
16, 43), though these qualities are 
less typical of the student artist than of 


Like 
27, 

31 

(9 


self-confident 


2. 25). and dominant 


the student leader. 


Scientific Athieve 


Table 5 summarizes the relationships 
between the student’s attributes in high 
school and his scientific achievement in 
college. 

Table 5 shows that the best predictor 
of scientific achievement in college is 
similar achievement in high school. The 
remaining predictors which are repli- 
cated or quasi-replicated are preferences 
for scientific and skilled occupations (2, 
14, 17), high self and teacher ratings of 
20, 24 


activities 


. engagement in cre- 
ative ia. on 
matical aptitude (7 
The paucity of findings in Table 5 
may be due to the small number of sci- 
entific achievements the 
checklist and to the infrequency of 
such achievements at the college level. 
On 


originality 


and mathe- 


included on 


common-sense grounds, we would 
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TABLE 5 


PREDICTION OF SCIENTIFIC ACHIEVEMENT OVER 1-, 2-, 3-, AND 4-YEAR PERIODS 


Variable Source 


Scientific Achievement 
(HS) 
. Physical Activity 
. Play 
. Status 
. Artistic Achievement 
(HS) 
. Social Leadership 
. SAT-Math 
. F Cheerful 
. College Grades (HPR) 
. Psychological 
Mindedness 
. Creative Activities (A) 
. Creative Activities (B) 
3. Scholarship 
. Physicist 
. Drive to Achieve 
. Social Science Teacher 
. Aviator 
. Breadth of Interest 
(HS) 
19. Verbal Activity 
20. Originality 
21. Emotionality 
22. Self-Confidence 
23. Accountant 
24. Originality 
25. Curious 
26. Dependable 


VPI 
NMSS 
VPI 


16 PF 


CPI 


Self-Rating 
SVIB 
Self-Rating 
SVIB 
SVIB 


VPI 
Self-Rating 
VPI 
Self-Rating 
SVIB 


Teacher Rating 
Fathers Values 
Fathers Values 


Time interval 


> 


2 years 4 years 


l year 


Boys Girls 


Boys Girls 


Boys Girls Boys Girls 
l6c 24c 
22c 18c 


20c 25c 


Teacher Rating 


12 
12 


—16 





probably not expect to find scientific 
achievement occurring with much fre- 
quency at the undergraduate level. 


Academic Achiever 


Table 6 summarizes the relationships 
between the student’s attributes in high 
school and his academic achievement in 
college. 

According to Table 6, the best pre- 
dictors of academic achievement in col- 
lege are good grades in high school and 
high self-ratings of scholarship. Most 
of the remaining predictors can be ar- 


ranged in eight clusters which are in- 
ternally consistent and related psycho- 
logically: self-control, persistence, and 
deferred gratification (5, 14, 21, 27, 
32) ; socialization and super-ego (7, 15, 
16, 26); unsociableness and lack of 
social presence (12, 29, 34) ; femininity 
(11, 67) ; passivity and timidity (37, 71, 
74); lack of potential for original be- 
havior (8, 28, 36, 73) ; restrictive paren- 
tal attitudes (30, 19, 20, 52, 53, 51, 77) ; 
and scholastic aptitudes, verbal and 
mathematical (23, 24). The academic 
achiever then can be characterized as a 
person who got high grades in high 
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TABLE 6 


PREDICTION OF ACADEMIC ACHIEVEMENT 


College grades (HPR) 


Variable Source Boys Girls 


1. Scholarship Self-Rating 
2. Drive to Achieve Teacher Rating 
3. High School Grades (HSR 
4. Drive to Achieve Self-Rating 
5. Perseverance Self-Rating 
. Self-Evaluation Self-Rating 
. Citizenship Teacher Rating 
. Originality Teacher Rating 
. Achievement via Conformance CPI 
. Femininity CPI 
. Social Presence CPI 
. Athletic Ability Self-Rating 
3. Deferred Gratification NMSS 
. Super Ego NMSS 
5. Responsibility CPI 
5. Intrusiveness PARI 
. Acceleration of Development PARI 
. Persistence NMSS 
. Math Aptitude SAT 
. Verbal Aptitude SAT 
. Socialization CPI 
. Self-Control CPI 
. Complexity-Simplicity 
4. Sociability CPI 
. Curious Fathers Values 
>. Status VPI 
. Control VPI 
. F Cheerful 16 PF 
Capacity for Status CPI 
. Q, Radical 16 PF 
. E Dominance 16 PF 
2. Achievement via Independence CPI 
G Persistent 16 PF 
34. Cheerfulness Self-Rating 
35. Independence of Judgment 
36. Artistic Achievement (HS 
. Intellectuality VPI 
. Seclusion of Mother PARI 
Excluding Outside Influences PARI 
. Ascendency of Mother PARI 
. Self-Acceptance CPI 
. Q,; Controlled 16 PF 
3. Introversion NMSS 
. Responsibility VPI 
5. N Shrewd 16 PF 
M Introverted 16 PF 
. I Effeminate 16 PF 
. Q, Self-Sufficient 16 PF 
9. Dependence NMSS 
50. Flexibility CPI 
. H Adventurous 16 PF 
52. Emotionality VPI 
3. Martyrdom PARI 
4. Dominance NMSS 
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school; conceives of himself as a good 
student; and is self-controlled, persistent, 
responsible, feminine, and passive, He 
comes from a family whose restrictive- 
ness and striving attitudes may have 
been conducive to academic achieve- 
ment with its emphasis on conformity. 
While the criteria of creative perform- 
ance are usually unrelated to academic 
performance (see Table 2), the aca- 
demic achiever has somewhat less po- 
tential for original behavior on tests of 
originality than the student who gets 
slightly lower grades. This apparent 
lack of originality may be a natural 
consequence of his parental background, 
since certain types of creative perform- 
ance may be in part a function of im- 
pulsive, undersocialized behavior. Both 
Barron (1953, 1955) and Gough (1953, 
1960) have proposed that high grades 
may be a product of oversocialization 
and original behavior a product of un- 
dersocialization, hypotheses supported 
by the present findings, which suggest 
how academic as opposed to creative 
achievement may occur. 


DISCUSSION 


Since the findings were obtained with 
samples of students having very high 
aptitudes, they cannot be directly ap- 
plied to students having less scholastic 
aptitude. The findings reported here 
are consistent, however, with those re- 
ported in previous research, which used 
similar variables on samples of less able 
students. For example, the finding that 
academic achievement is correlated 
with self-control, persistence, socializa- 
tion, and super-ego strength reiterates 
some earlier findings on academic 
achievement summarized recently by 
Gowan (1960). However, ether clus- 
ters of variables associated with aca- 
demic achievement appear to be op- 
posed to some of the “factors” in 
Gowan’s summary. Gowan reports, for 


example, that academic achievement is 
associated with “Anti-authoritarian Be- 
havior”; the present study, which uses 
subjects more homogeneous with re- 
spect to intelligence and socioeconomic 
status, suggests the opposed hypothesis. 
It is difficult to make further compari- 
sons between the present findings and 
Gowan’s interpretative summary be- 
cause of the diversity of the inventories, 
techniques, and definitions used. 

The results for undergraduate achieve- 
ment in science and art fields appear 
consistent with several earlier studies. 
Gough (1953) found that creative per- 
formance in these fields is associated 
with scientific and artistic vocational in- 
terests. Taylor, Smith, Ghiselin, and El- 
lison (1961) report that creative people 
see themselves as original, and Barron 
(1953) and MacKinnon (1959) find 
that such people get high scores on 
originality measures. 

A review of the predictors of student 
achievement reinforces axioms 
about the prediction problem. It is 
clear that the best single predictor of 
each of the four kinds of undergraduate 
achievement is similar achievement in 
high school, an impressive point when it 
is realized that the achievement scales 
checklists) for both high school and 
college are extremely brief and have 
low reliability (Holland, 1961). The 
lunited variability in high school rank 


some 


and honor point ratio also militates 
against the efficiency of HSR as a pre- 
dictor, 

The predictive as opposed to the con- 
current validity of many of the variables 
makes explicit the misleading nature of 
concurrent studies. For example, many 
of the student variables which had sig- 
nificant concurrent correlations with 
achievement in high school failed to 
predict similar achievement in college 

cf. Holland, 1961 with the findings in 
Tables 3-6). 


If we compare the predictive effi- 
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ciency of different types of variables, 
we see that, next to past achievement, 
the best predictors are self and teacher 
ratings, interest inventories, and simple 
checklists of creative activities and in- 
terests. With a few exceptions, True- 
False type personality and originality 
scales are next in order. Finally, paren- 
tal attitude scales and scholastic apti- 
tude tests are the least efficient pre- 
dictors of later achievement. Taylor 
et al. (1961), in studies of adult scien- 
tists, found that their predictors formed 
a similar hierarchy of efficiency. 


This crude ordering implies that for 


purposes of prediction we could aban- 
don our efforts to construct sophisti- 
cated inventories, and instead, concen- 


trate on securing more elaborate records 


of past achievement, and more system- 


atic reports of the student’s self-concep- 
tions, interests, and daily activities. Per- 
sonality and originality scales would 
have to be used concurrently, to give 
meaning to these new devices and in- 
ventories, since biographical and self- 
reports are frequently difficult to in- 
terpret. 

The relative success of checklists of 
past achievement as predictors of col- 
legiate achievement suggests that some 
of the 
found here could be raised if we revised 
these brief achievement scales so that 
they would cover a greater range of 
achievements. More normal 
tions should yield higher correlations. 

The intercorrelations of social and 
artistic achievement suggest that we 
need to clarify the nature of these 


low or negligible correlations 


distribu- 


achievements and to find ways of dis- 
tinguishing between them. This prob- 
lem is part of a larger one—we must 
find dimensions of 
and order these criteria more systemati- 


new achievement 
cally so that we can study them more 
thoroughly. The present criteria still 
represent only a few of the different 
kinds of achievement. 
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The purpose of this study was to determine personal and maternal 
correlates of academic achievement among Mexican-American second- 
ary school students. 4 groups (15 each) of achieving and low achiev- 
ing boys and girls were identified and equated in age, IQ grade level, 
and courses taken. Employing standard objective measures, it was 
shown that achievers manifested reliably less hostility and more social 
maturity, intellectual efficiency, and conformity to rules. Achieving 
girls and underachieving boys appear to come from strong mother- 
dominated homes. Since these findings seem meaningful with respect 
to this subculture, it is suggested that such study of achievement in 
minority groups may better clarify the nature of relationships among 


hypothesized variables 


achievement (or 
overachievement) and 
ment have tended to focus upon three 
main aspects of this problem: (a) stu- 
dent personality (McCandless, 1947; 
McClelland, Atkinson, Clark, & Lowell, 
1953; Morgan, 1952), (6) parental at- 
titudes relative to scholastic perform- 
ance (Gough, 1953; Haggard, 1957; 
Kimball, 1953), and (c) socioeconomic 
correlates of academic achievement 
(Douvan, 1956; Milner, 1951). Need- 
less to say, research findings relative to 
the general Anglo-American group of- 
tentimes have been contradictory. Not 
only have a wide variety of measuring 
instruments been used, but the group 
itself has been so heterogeneous that 


Investigations of 
underachieve- 


generalizations across regions, classes, 
school systems, ages, etc. can be made 
only as very tentative hypotheses for 
further investigation. The complexity 
of this problem with the Anglo-Ameri- 
can group has been discussed theoreti- 
cally but has often been ignored in 
actual study (Gough, 1953; Roth, 
1956). 

Research relative to possible fac- 
tors influencing academic achievement 
among minority groups is lacking even 


though many of the apparently con- 
founding variables might be more easily 
controlled and observed with such 
groups, Cultural determinants usually 
can be defined with greater accuracy; 
the socioeconomic distribution of the 
group is often quite limited; and geo- 
graphic, rural-urban location, language 
influences, etc., are easily circumscribed. 
It was thus the purpose of the present 
study to investigate relationships be- 
tween selected nonintellectual student 
variables and parental attitudes and the 


academic performance of a group of 
Mexican-American, lower-class second- 
ary school students. In addition, an ef- 


fort was made to control academic per- 
formance by employing a criterion of 
achievement on which all subjects were 
matched. 

Literature pertaining to the Mexican- 
American indicates that this group is 
characteristically low in academic 
achievement. There are, however, 
within this population many students 
who are academically successful. Jones 

1948) and Marden (1952) indicate 
that the Mexican-American group pos- 
sesses a culture steeped in custom and 
tradition with a strongly authoritarian 
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family pattern of a patriarchal form. 
Women are highly controlled but are 
the controlling agent within the home in 
regard to the children. Predominantly 
lower class in the United States, Mexi- 
from the dual 


can-Americans suffer 


frustrations of poverty and prejudice, 


possibly responding with greater hostil- 


lower-class non-Mexican 
white counterparts (Swickard & Spilka, 
1961). Verbal deficiency and other neg- 
as mobility and 


ity than theu 


ative influences sucl 


poor living conditions undoubtedly 
make school attendance and academi 
performance a formidable experience. 

On the basis of the foregoing the fol- 


lowing hypotheses were tested: 


1. Assuming that education might be 
positively valued within this group in 
the United States, especially where tra- 
also es- 


familial controls are 


teemed, it was hypothesized that achiev- 


ditional 


ing students would come from homes in 
which the mother is more dominating, 
possessive, and ignores the child less 

2. It was hypothesized that achieve- 
ment of the students coming from this 
background, would be positively related 
to: their conformity to the requirements 
of the academic situation, and the extent 
and tendency 


of their self-motivation 


toward independent action 
> = 
achievement would be positively related 


also was hypothesized that 


to intellectual efficiency and social ma- 
turity. 

+. The achieving student, it was fur- 
ther expected, would reveal less hostility 
and indications of 
proved and adaptive means of coping 


more socially ap- 


with anxiety. 


METHOD 
Subjects 


Sixty high school junior and senior stu- 
dents served as subjects. Thirty had main- 


tained grade point GPA) above 


the seventieth percentile for their class, while 


averages 


the remaining 30 students had GPAs below 
the thirtieth percentile, The former group 
Achievers, and the latter, 
The IQs of all subjects were 


was designated, 
Underachievers 
in the 90-110 range, and the groups were 
matched on IQ, age, sex, and grade. Each 
group contained 15 boys and 15 girls. Table 
1 presents these data. No significant differ- 
ences were found between sexes, classes, and 
groups on age or IQ, whereas differences in 
GPA between groups were as desired, signifi- 
cant well beyond the .01 level 


and Measure 5 


Tests 


IQ was assessed by the Otis SA Test of 
Mental Ability GPAs were 
based on grades obtained only in academic 


Gamma form 


courses which all students had in common, 
and which appeared to emphasize the verbal 
and symbol skills measured by the 
Otis 

Parental variables were evaluated by the 
1957 


usage 


modification of 
devised by 


Drews and Teahan 
the Parent Attitude 
Shoben (1949). 


subscales designed to 


Survey 
This scale consists of three 
measure dominating, 
possessive, and ignoring attitudes of mothers 
toward their offspring. 
The California Psychological 

CPI) (Gough, 1957) was used to 
following achievement 


Inventory 
assess the 


student variables 


TABLE 


MEANS AND STANDARD DEVIATIONS FOR 
Groups AND SEXES ON THE CONTRO! 
VARIABLES 


Achievers U nderachievers 


Girls Boys Girls Boys 
V 1I5)\(N I5))(N 15))(.N 15 


Variable 


Otis IQ 
M 
SD 
GPA 
M 
SD 
Age in 
months 
M 7 206.7 


SD j 6.2/ 
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by conformance (Ac scale), achievement 
through independent behavior (Ai scale), in- 
tellectual efficiency and resourcefulness (Ie 
scale), and social maturity (So scale). 

The Siegel Manifest Hostility Scale 
(MHS), a 50-item inventory developed by 
Siegel (1956), was utilized as an objective 
measure of hostility; and the Jewell Anxiety 
Adaptation Scale was selected to measure 
student adaptation to anxiety. Jewell (1959) 
developed this scale in an attempt to deter- 
mine differences in individual reaction to 
anxieties, stressing responses which are per- 
formance facilitating in contrast to behavior 
which interferes with proper performance. 


Procedure 


The tests were administered to the stu- 
dents during school hours. A code was em- 
ployed in lieu of names on the forms, and 
the students were assured that anonymity 
would be preserved and none of these re- 
sults would become part of a school record. 
The mother was chosen to represent parental 
attitudes. To help insure understanding of 
the test items, the Drews-Teahan questions 
were read to each of the mothers. All sub- 
jects were highly cooperative. 

Statistical evaluation of the obtained data 
was accomplished through use of a groups- 
by-sexes analysis of variance design (Lind- 
quist, 1953). 


RESULTS 

Parental Attitudes 

On the Drews-Teahan 
Shoben’s Parental Attitude Survey no 
significant differences were found for 
the main effects or interaction on the 
Possessive and Ignoring scales. The Sex 
x Groups interaction term for the 
Dominance scale attained significance 
at the .05 level of confidence (F = 4.28, 
df = 1/56). In order to assess this find- 
ing the means of groups and sexes were 
corrected for these main effects and 
Tukey procedure (Edwards, 1954) was 
applied. The corrected means were 33.2 
for achieving boys and underachieving 
girls and 35.7 for low achieving boys 
and high achieving girls. The differ- 
ences between these mean pairs account 


version of 


for the observed significance at the .05 
level of confidence. No significant dif- 
ferences were found between the moth- 
ers of the high and low group students 
on either the possessive or ignoring atti- 
tude scales. Hypothesis 1, namely, that 
achieving students would come from 
homes in which the mothers tend to be 
dominating, possessive, and attentive, 
was partially supported. 


Student Variables 

There were no significant differences 
between the responses of achieving and 
underachieving students on the Ai scale 
of the CPI. The groups did differ sig- 
nificantly on the three other scales of 
the CPI. On the Ac scale differences 
exceeding the .01 level of confidence 
were found between groups (F = 75.46, 
df = 1/56) , between sexes (F = 7.39, df 
= 1/56), and also for the Sex X Groups 
interaction term . F = 33.80, df = 1/56). 
Significant differences at the .05 level 
of confidence were also found between 
the two groups of students on Ie (F = 
5.41, df = 1/56) and So (F = 7.09, df 
= 1/56). Hypothesis 2 was supported for 
achievement via conformance, but not 
for self motivation and independent ac- 
tion. Hypothesis 3 is also supported in 
that the achievers appear to be both 
significantly more intellectually efficient 


and socially more mature than their un- 


derachieving classmates. 

The MHS yielded significant differ- 
ences between the groups in the direc- 
tion predicted at the .01 level of confi- 
dence (F = 13.22, df = 1/56) and also 
between sexes (F = 8.37, df = 1/56). 
Significant differences, at the .05 level 
of confidence (F = 5.73, df = 1/56) 
were found between groups on Jewell’s 
Anxiety Adaptation Scale. Hypothesis 4, 
namely, that achieving students would 
be less hostile and show more socially 
acceptable means of coping with anxi- 
ety, was thus supported. 
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TABLE 2 


MEANS AND STANDARD DEVIATIONS FOR GROUPS AND SEXES ON TEST VARIABLES 
(N per group = 15) 


Variable Achievers 


Boys 


Girls 


Mother variables 
Dominance 
Possessive 


Ignoring 


Student variables: 


Manifest Hostility 


Anxiety Adaptation 


16.: 
} 
Conformity Achievement 7 24 


2 


> 


Independent Achievement 


Intellectual Efficiency 


34 
+. 


5) 


39.9 36.0 
6.09) 


Socialization 
5.65 


Note.—Standard deviations are given in parentheses 


DISCUSSION 


It was hypothesized that achieving 
students came from more mother-domi- 
nated homes than underachievers. The 
results indicate that no significant dif- 
ferences in mother domination existed 
between groups but the Sex X Group 
interaction yielded differences which 
were significant; that is, underachieving 
boys and achieving girls come from 
homes in which the mothers are more 
dominating than are the mothers of 
achieving boys and _ underachieving 
girls. 

In view of this finding, it may be 


conjectured that restrictive and de- 


The boy, 


Significance level 


Sex & 
Group 
interaction 


Underachievers | 


Between 
sexes 


Between 


Girls Boys groups 


14.5 
5.08) 
21.7 


5.09) 


manding maternal attitudes are per- 
ceived by the academically successful 
girl as affectional concern for her. The 
daughter, therefore, can identify with 
her mother and accept maternal control 
if the traditional controls are present. 
in contrast, may view ma- 
ternal domination as an infringement 
on his independence strivings and react 
negatively to it. In this ethnic 
the woman’s role traditionally is inferior 


group 


to the male’s position, The male teen- 
ager, then, may reject maternal domina- 
tion and possibly use poor school per- 
formance as a weapon against his de- 
manding mother. 

The achieving boy who is less domi- 
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nated may perceive maternal controls 


as containing elements of reasonableness 
and affection; he, then, may accept 
his mother’s demands and values and 
do well in an academic setting. The 
girl who is less dominated possibly sees 
this lack of control on the part of her 
mother as a not-caring attitude and she, 
in turn, develops an apathetic attitude 
toward educational attainments. 

Mothers of achieving and under- 
achieving students did not differ in 
their attitudes of possessiveness and ig- 
noring the child. However the high 
scores on both of these scales in com- 
parison to middle-class, Anglo mothers’ 
responses may indicate that these terms 
are qualitatively different for Mexican- 
American families which oftentimes are 
larger, extremely poor, and with un- 
stable paternal domination. 

The achieving student apparently has 
learned to conform to rules and regula- 
tions and to work under supervision. In 
an academic setting where rapid and 
diligent work habits are highly valued, 
the achiever wins good grades. He does 
not appear to feel hostility toward au- 
thority figures, such as teachers, because 
their demands have served as means for 
reward and personal satisfaction. He 
may therefore have developed a strong 
sense of propriety through an accept- 
ance of rules and customs and thus 
learned to respond to anxiety produced 
by educational demand with perform- 
ance facilitating behavior, by doing the 
“right” things. Achieving girls appar- 
ently conform more than either achiev- 
ing boys or underachieving girls or boys. 
This is in keeping with the above find- 
ing on maternal dominance, and the 
role of the female in this ethnic group. 
The fact that the achieving girls ob- 
tained the lowest scores on the MHS 
further supports this interpretation. 

The underachiever who tends to re- 
spond to anxiety maladaptively proba- 


bly is frustrated further by the increas- 
ing demands for conforming and aca- 
demically productive behavior, The 
scholastically unsuccessful student, who 
tends to respond on a less efficient in- 
tellectual level, has difficulty accepting 
the demands of authority figures toward 
whom he feels great hostility. Unable 
to meet the demands of the school, the 
underachiever just does not perform 
scholastically, and thus earns this label. 

The lack of significant differences be- 
tween the groups on the Ai scale of the 
CPI together with the relatively low 
scores for all students may be attributed 
to the fact that independent initiative is 
not characteristic of low socioeconomic 
students. Douvan (1956) reports that 
conditions for the lower class are such 
that success and failure are less person- 
alized. The group members are familiar 
with the effects of external factors such 
as unemployment so there is little need 
to strive unless success involves some 
meaningful, material reward, The nega- 
tive effects of prejudice also could tend 
to accentuate the highly dependent po- 
sition of the Mexican-American stu- 
dent. 

The generality of these findings, par- 
ticularly those on the student variables, 
may be argued. Their meaning how- 
ever is underscored in the Group 
Sexes interaction for maternal domi- 
nance. The significance of this variable 
is especially evident in the Mexican- 
Amcrican culture, It is thus suggested 
that additional specification of group 
customs and milieu may similarly con- 
tribute to further understanding of non- 
intellectual factors associated with aca- 
demic achievement. 
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THE EFFECTS OF ANXIETY, ACHIEVEMENT MOTI- 
VATION, AND TASK IMPORTANCE UPON PER- 
FORMANCE ON AN INTELLIGENCE TEST 


LAWRENCE S. WRIGHTSMAN, JR. 
George Peabody College for Teachers 


In a study to determine how anxiety and achievement motivation 
affect performance on an intelligence test under 2 conditions of task 
importance, 234 Ss completed the MA scale, a projective measure of 
achievement motivation, and an intelligence test. 122 Ss were led to 
believe that performance on the intelligence test was very important; 
the other 112 Ss were led to believe the test was given only to collect 
norms, When the test was seen as important, anxious Ss scored signifi- 
cantly lower than nonanxious Ss, In the other condition, anxiety was 
unrelated to test performance. Ss high in internalized achievement 
motivation performed equally well on the test regardless of its pur- 
ported importance. Low achievement motivation Ss did less well 
when they thought the test was important, perhaps indicating an 
avoidance reaction to stress. 





lower threshold of emotional responsive- 
a stronger emo- 


Although much research has been 


done recently upon the effects of anx- ness and react with 
tional response than low A subjects to 
situations containing some degree of 
stress” (p. 137). A different type of 
theoretical formulation has been ad- 
vanced by Sarason and his associates 
(Ruebush, 1960; Sarason, Davidson, 
Lighthall, & Waite, 1958), who feel that 
the highly anxious person acts in a cau- 
tious, defensive, dependent way which 


often impedes his performance on a 


iety and achievement motivation, their 
influence upon performance in everyday 
testing situations has not been suffi- 
ciently studied. The purpose of this 
study is to determine how each of these 
variables influence performance on an 
intelligence test under different condi- 
tions of task importance. 

The question of whether a high de- 
gree of anxiety has a debilitating effect 
upon performance on a complex task _ test. 
has been considered by Spence (1958) 
in a statement of the theory underlying 
with the Manifest Anxiety 
Taylor, 1953). Spence 


The stronger emotional response on 
the part of high anxious subjects in a 
stressful situation has been found to in- 
terfere with their successful completion 
of a complex task such as is involved in 
learning of paired 

Spence, 1956). Likewise, highly anx- 
ious children, who are also found to be 


research 
(MA) 

presents evidence for acceptance of an 
“emotional reactivity hypothesis,” which the 
states that “high A subjects have a 


scale 
associates 


*The author wishes to express his thanks 


. vo more cautious, do less well on a prob- 
to Carolyn Ashcraft, Ruth Vineberg, and ? I 
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scoring and data analysis; to John Throne, 
who administered and scored the MA scale 
data; and to Lorene C. Quay, Norman R. 
Ellis, and Elizabeth G. French, who made 
helpful suggestions regarding the interpreta- 
tion of the data. 


lem solving task than do less anxious, 
less cautious children (Ruebush, 1960). 
The complex task selected for investiga- 
tion in this study was of a different 
type: performance on an intelligence 


test. It was assumed that by manipulat- 
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ing the importance of the test to the 
subjects the stress of the situation could 
also be manipulated. It was hypothe- 
sized that when the intelligence test was 
seen as unimportant there would be no 
relationship between level of anxiety 
and score on the intelligence test, but 
that when the test was seen as impor- 
tant the relationship would be a nega- 
tive one. 

The second variable whose effects 
were under scrutiny in this study is the 
level of achievement motivation. In one 
of the more fruitful approaches to this 
problem, McClelland and his associates 
(Clark & McClelland, 1955; Lowell, 
1952: McClelland, Atkinson, Clark, & 
Lowell, 1953) have found that in re- 
gard to simple tasks the strength of 
achievement motivation in some sub- 
jects may be experimentally aroused 
and manipulated through the use of in- 
structions which focus on the impor- 
tance or unimportance of the task. The 
question arises as to whether experi- 
mental arousal of task importance 
would be effective in influencing per- 
formance on a more complex task such 
as an intelligence test. The little evi- 
dence available on the question of the 
effects of motivational arousal upon 
performance in complex tasks is in con- 
flict (Fleishman, 1958; Wrightsman, 
1960) and comparisons between studies 
are difficult to make because of differ- 
ences in types of subjects and tasks used. 

A possible reason for previous negative 
findings may lie in a failure to consider 
the level of internalized achievement 
motivation in the subjects. Achievement 
motivation theory (McClelland et al., 
1953) would predict that the perform- 
ance of persons high in internalized 
achievement motivation would be unin- 
fluenced by external conditions such as 
the purported importance of the task, 
because their internalized standard of 
excellence would demand a highly moti- 
vated approach to any task. On the 


other hand, performance of the person 
low in internalized achievement motiva- 
tion may more clearly reflect the im- 
portance of the task, as it is defined by 
external conditions. Therefore, it was 
hypothesized that the purported im- 
portance of the task would not influence 
performance of subjects high in inter- 
nalized achievement motivation, but 
would influence the performance of 
subjects lower in internalized achieve- 
ment motivation. 


METHOD 


Normally, all freshmen at George Peabody 
College for Teachers are administered the 
American Council on Education Psychologi- 
cal Examination (ACE) and the Coopera- 
tive English Tests as a part of the freshman 
orientation program. After answering these 
instruments, the 234 freshmen (59 males and 
175 females) entering in the fall of 1959 
administered three additional tests 
These were the MA scale (Taylor, 1953 
the Test of Insight, a projective measure of 
achievement motivation devised by French 
(1958) ; and the Adaptability Test (Tiffin & 
Lawshe, 1954), a timed measure of intelli- 
gence. Before taking the last test, 122 sub- 
jects were led to believe that the results of 
this intelligence test would be very important 
to them, in that the results would follow the 
student throughout his college career (Im- 
portant Test condition). The other 112 sub- 
jects, members of the Unimportant Test con- 
dition, were led to believe that their re- 
sponses to the intelligence test were being 
collected for normative purposes only. Sub- 
jects in the two conditions were tested at the 
same time, as the manipulation of the im- 
portance of the test was done entirely through 
the use of printed instructions on the front 
of the test. After completing the intelligence 
test, subjects were asked to indicate how im- 
portant they thought it was for them to do 
well on the test, by placing a number from | 
through 10 under their name on the test 
booklet. An analysis indicated that the ma- 
nipulation was successful in that subjects in 
the Important Test condition placed signifi- 
cantly greater importance to doing well (x’ 
= 5.52, p < .02). 

No significant between-condition differ- 
ences were found on the ACE, the MA scale, 
or the achievement motivation measure, An 


were 
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estimate of the interscorer reliability of the 
achievement motivation measure was deter- 
mined by comparing the experimenter’s scor- 
ing of the responses of 30 subjects with the 
scoring done by another person trained in 
the scoring procedures, The resultant reliabil- 
ity estimate was .83. 

There were no significant sex differences 
on any of the measures. The finding that 
males and females did not differ in the extent 
of achievement motivation—a finding in con- 
tradiction to some previous research (Pierce 
& Bowman, 1959)—may be attributed to the 
use of female names on the form of the 
Test of Insight administered to female sub- 
jects. Despite the absence of sex difference in 
mean score, the relationships between MA 
scores and measures of aptitude such as the 
ACE and Adaptability Test are more nega- 
tive for females than for males. Henceforth, 
findings for each sex will be reported. 

Correlation between the MA scale and the 
achievement motivation measure was —.14. 
Despite its being significant at the .05 level, 
the relationship is of such small magnitude 
that it may be said with confidence that the 
two scales are measuring essentially different 
variables. 

In both conditions performance on the MA 
scale was significantly negatively correlated 
with ACE-L and ACE-Q scores (—.19 and 
—.14 in the Unimportant Test condition and 
—.28 and —.25 in the Important Test condi- 
tion). This confirms previous findings that in 
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college samples there is sometimes a slight, 
but significant, negative relationship between 
level of anxiety and performance on measures 
of scholastic aptitude (Spielberger, 1958). 
Those subjects who were to be in the Im- 
portant Test condition had higher ACE 
anxiety correlations than did those wh» were 
to be in the Unimportant Test condition. 
These differences (—.19 vs. —.28 and —.14 
vs. —.25) approach but do not reach the .05 
level of significance, and there is no clear-cut 
reason other than a coincidental one for the 
differences. However, the differences should 
be borne in mind and the effect of the 
manipulation should be seen as possibly in- 
creasing an already existing near-difference. 
In an effort to eliminate these differences, 
partial correlation coefficients were com- 
puted in addition to the first-order coeffi- 
cients. 


RESULTS 


As is indicated in Table 1, the cor- 
relation coefficient between MA scores 
and the Adaptability Test scores in the 
Unimportant Test condition was a non- 
significant —.06; but in the Important 
Test condition, the comparable corre- 
lation was —.37, significant at the .01 
level. The correlations for the two con- 
ditions differed significantly at the .01 


TABLE 1 


CorRELATIONS BETWEEN MA Score, Apaptasitiry Test Score, AND ACE Scores 








Unimportant Test 
condition 


Important Test 
condition 





Correlation 


Males 


(N = 32) | (N 





09 


MA and Adaptibility 


MA and Adaptability, with 
ACE-L partialed out 


MA and Adaptability, with 
ACE-Q partialed out 


MA and ACE-L 
MA and ACE-Q 
Adaptability and ACE-L 
Adaptability and ACE-Q 


24 


50** 


749 


Females 
= 80) |(N = 112) 


12 


—.18 


—,2g** 


Males 
N = 27) 


Total 
(N = 122) 


Females 


Total 


| 
| 
| 
| 
| 
| 
| 
| 


—.06 —.40** 


.00 .07 


—,24%8 


ms. .05 -—a, —29°° 

~ 19 

— 14 
.63** 


Ph aad 


_—, 328 —,28** 
—31/#* — 958 
.68** 71*# 


Nata 


sare 





** Significant at the .01 level. 
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level, using a one-tailed test. The re- 
lationship was greater in the case of the 
females than the males, although the 
difference was not statistically signifi- 
cant. 

A similar result occurs when the ef- 
fects of ACE scores are partialed out; 
performance on the Adaptability Test is 
related to MA score only under the con- 
dition that the test performance is im- 
portant. It thus appears that when per- 
formance on the intelligence test carries 
little importance, a person’s level of 
anxiety is unrelated to his performance, 
but when the task is seen as important, 
persons high in anxiety suffer impaired 
performance. Sex differences in the 
Important Test condition are reduced 
after partialing out the effects of the 
ACE test. 

Further confirmation is found 
comparison of the MA-Adaptability cor- 
relations in the upper part of Table | 
with the MA-ACE correlations in the 
lower part. Since the ACE was admin- 
istered to the subjects as a part of the 
standard freshman testing battery, it 
may be assumed that, to the Unimpor- 
tant Test subjects, it bore a level of 


in a 


TAB 


MEAN ADAPTABILITY TEesT ScorEs FoR Su 
(N in cell in 


Unimportant Test 


cond 
Subjects 
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importance greater than that attached 
to the Adaptability Test, and to the 
Important Test subjects, a level of im- 
portance Jess than that attached to the 
Adaptability Test. If this assumption is 
correct and if anxiety has a debilitating 
effect on test performance only when 
the test is considered important, differ- 
ential effects would be expected in the 
two conditions. In the Important Test 
condition, the MA-Adaptability corre- 
lation should be more negative than the 
MA-ACE correlations, to reflect the 
greater perceived importance of the 
Adaptability Test and the consequent 
deterioration of performance by anxious 
subjects. For Unimportant Tests sub- 
jects, the MA-Adaptability correlation 
should be less negative than the M4A- 
ACE correlations, as here performance 
on the test is presented as unimportant. 
This differential finding does hold, as is 
indicated in Table 1, although the dif- 
ferences between the MA-Adaptability 
and MA-ACE correlations do not reach 
a level of statistical significance. 
Another indication of effects of anx- 
iety level is revealed in Table 2. Men 
and women in each condition were di- 


LE 2 


BJECTS WITH DIFFERENT ANXIETY SCORES 
parentheses ) 


Important Test 
ition condition 





Total 


Females 


Top quarter of MA scale 
Distribution (Scores of 19-36) 


Third quarter of MA scale 
Distribution (Scores of 14-18) 


Second quarter of MA scale 
Distribution (Scores of 9-13) 


First quarter of MA scale 
Distribution (Scores of 1-8) 


Totals 


Males Females 


18.5 (23) 


17.9 (18) 


20.1 (21) 


19.5 (18) 


19.0 (80) 


Total 





19.4 (32) 


18.0 (24) 


20.4 (30) 


20.1 (26) 


19.6 (112) | 


15.0 (6) 


19.6 (10) 


23.0 (3) 


18.4 (8) 


18.6 (27) 








16.0 (21) 


18.0 (26) 


19.6 (26) 


20.5 (22) 
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vided into four quarters on the basis of 
MA score. Mean Adaptability Test 
scores for subjects in the Unimportant 
Test condition bear no relation to the 
level of anxiety score. In the Important 
Test condition, it appears that the sub- 
jects in the highest quarter of the MA 
distribution are the ones whose Adapta- 
bility scores are detrimentally influenced 
by the stress of the situation. This is 
true of both sexes, although the number 
of males in this cell is quite small. 

An analysis of variance of the data in 
Table 2 indicates that mean Adapta- 
bility scores did not differ between con- 
ditions, but that subjects higher in 
anxiety did significantly poorer. The 
findings are in keeping with the results 
of the correlational analysis presented 
above. 

To study the effects of level of 
achievement motivation, subjects were 
divided into high, medium, and low 
groups on the basis of achievement mo- 
tivation score. Table 3 presents the re- 


sults of the analysis for the second 
hypothesis. The prediction that the im- 
portance of the task would influence 
performance of low achievement moti- 
vation subjects but not influence that of 
high achievement motivation subjects 


TABLE 3 


ANALYSIS OF VARIANCE TABLE FOR 
ADAPTABILITY TeEsT SCORES 


Source of 
Variance SS dj 


Achievement 
motivation 
level 1.26 

Importance 
treatment 1.36 

4.10 


Error — 


1.36 
2.05 4.04** 
0.51 — 


Interaction 
228 
‘Note.—Analysis from Walker and Lev (1953, pp. 
381-382). 


** Significant at 


01 level 


would imply the presence of a signifi- 
cant interaction between treatments and 
levels of achievement motivation. Table 
3 indicates that the interaction was sig- 
nificant at the .01 level. However, 
neither the importance of the tasks nor 
the internalized level of achievement 
motivation in the subjects had any 
effect upon thei scores on the intelli- 
gence test. 

Table 4 presents the mean Adapta- 
bility Test scores of subjects with vary- 
ing levels of achievement motivation. 
The importance manipulation was suc- 
cessful in influencing Adaptability scores 
only for the low achievement motiva- 
tion subjects, who did poorer when the 
test was important than when it was 
unimportant, This finding occurs in the 
case of subjects of each sex, 


DISCUSSION 


The findings regarding the effects of 
anxiety confirm both the “cautiousness” 
and the “emotional reactivity” hypothe- 
ses and indicate that a stressful situation 
interferes with the successful perform- 
ance of high anxious subjects on a com- 
plex task. The contribution of this study 
may lie in the type of complex task used, 
for the effects upon performance in an 
intelligence test demonstrated in this 
study are quite similar to the effects of 
anxiety arousal on maze learning per- 
formance (Sarason, Mandler, & Craig- 
hill, 1952) and on the learning of paired 
associates (Spence, 1956). 

If the findings may be generalized to 
other testing situations, it would seem 
that the actual test performance of ex- 
tremely anxious persons is an underesti- 
mate of what they can do under more 
relaxed conditions. The performance of 
extremely anxious subjects in this study 
was reduced by almost one standard 
deviation by the stress of the instruc- 
tions. The test performance of less anx- 
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TABLE 4 


MEAN ApapTABILiTry Test Scores FoR SUBJECTS VARYING IN 
ACHIEVEMENT MOTIVATION 
N in cell in parentheses) 


Unimportant Test 


Achievement motivation 
classification 


Women 


scores of 5-14 


High 


Medium (scores of 3 and 4 


Low (scores of 0-2 


ious subjects seems unaltered by the 
stress of the situation. 

The findings regarding achievement 
motivation indicate that subjects with a 
high drive to do well try equally hard 
regardless of the task’s importance, and 
corroboration is found for past research 
indicating that college students perform 
equally well on an intelligence test 
whether the test is seen as important or 
not (Wrightsman, 1960). The attempt 
to increase level of performance through 
experimental arousal of motivation was 
apparently not successful. 

The worsened performance of low 
achievement motivation subjects when 
the task is important perhaps indicates 
an almost deliberate failure to exert full 
effort under the challenge of an im- 
portant situation, Atkinson (1958, p 
332) states that there seems to be good 


presumptive evidence that fear of fail- 


ure is greater in low achievement moti- 
vation groups. In the earlier scoring of 
the Test of Insight, responses indicating 
both positive and negative aspects of 
achievement motivation were scored to- 
The two items of the Test of 
Insight most likely. to elicit responses of 


gethe 


fear of failure in achievement were re- 
scored for this type of response alone 
These items, on the female form, were 


Number 5, “Mary Ann worries a lot 


condition 


Important Test 
condition 


Women 


19.6 (34 20.4 (30 
19.1 (44 2 18.7 (38 


20.0 (34) 16.4 (27 


tests and 
examinations” and Number 8, “Janet 
works harder than people.” 
Though much more evidence is neces- 
sary, there little 

that the low achievement 
subjects shield themselves from full out- 
put on an important task because they 
fear the outcome. 


about how she has done on 
most 


was very indication 


motivation 
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